The rate and molecular spectrum of mutation are selectively maintained in yeast

Liu, Haoxuan; Zhang, Jianzhi

doi:10.1038/s41467-021-24364-6

Download PDF

Article
Open access
Published: 30 June 2021

The rate and molecular spectrum of mutation are selectively maintained in yeast

Nature Communications volume 12, Article number: 4044 (2021) Cite this article

6824 Accesses
10 Citations
5 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 04 April 2023

This article has been updated

Abstract

What determines the rate (μ) and molecular spectrum of mutation is a fundamental question. The prevailing hypothesis asserts that natural selection against deleterious mutations has pushed μ to the minimum achievable in the presence of genetic drift, or the drift barrier. Here we show that, contrasting this hypothesis, μ substantially exceeds the drift barrier in diverse organisms. Random mutation accumulation (MA) in yeast frequently reduces μ, and deleting the newly discovered mutator gene PSP2 nearly halves μ. These results, along with a comparison between the MA and natural yeast strains, demonstrate that μ is maintained above the drift barrier by stabilizing selection. Similar comparisons show that the mutation spectrum such as the universal AT mutational bias is not intrinsic but has been selectively preserved. These findings blur the separation of mutation from selection as distinct evolutionary forces but open the door to alleviating mutagenesis in various organisms by genome editing.

Changes in the distribution of fitness effects and adaptive mutational spectra following a single first step towards adaptation

Article Open access 31 August 2021

Dimitra Aggeli, Yuping Li & Gavin Sherlock

Synonymous mutations in representative yeast genes are mostly strongly non-neutral

Article 08 June 2022

Xukang Shen, Siliang Song, … Jianzhi Zhang

Genome architecture and stability in the Saccharomyces cerevisiae knockout collection

Article 11 September 2019

Fabio Puddu, Mareike Herzog, … Stephen P. Jackson

Introduction

Mutation is the ultimate source of all genetic variations, including those driving adaptations and those causing hereditary diseases. Therefore, the mutation rate per nucleotide per generation (μ) and its evolution are of broad relevance and interest. Because the vast majority of mutations are deleterious, Sturtevant famously asked in 1937 why μ has not been reduced by natural selection to zero¹. While he sighed that “no answer seems possible at present”, much progress has been made in the intervening years^{2,3,4,5,6,7,8,9,10,11,12,13}. It is now recognized that an organism’s μ is jointly determined by its genotype¹⁴ and environment¹⁵ and is subject to natural selection^6,12, and that the selection can arise from three factors: deleterious mutations, beneficial mutations, and the cost of fidelity^6,13. Deleterious mutations reduce organismal fitness, leading to the selective fixations of mutation rate modifiers that lower μ and a decrease of μ (Fig. 1a). By contrast, beneficial mutations raise organismal fitness, leading to the selective fixations of mutation rate modifiers that increase μ and an elevation of μ (Fig. 1a). In these two selections, the fitness effect of the modifier lies entirely in the mutations created and linked with the modifier, so the modifier is subject only to the so-called second-order selection. The cost of fidelity refers to the fitness cost due to the energy and time spent on proofreading, repair, and other biological processes that reduce μ. Hence, the cost of fidelity creates a first-order selection for mutation rate modifiers that increase μ, resulting in an uplift of μ (Fig. 1a). Therefore, a nonzero μ can result from a balance between the respective selections for lower and higher μ. We will refer to this answer to Sturtevant’s question as the conventional model.

**Fig. 1: Theoretical framework of mutation rate evolution and the study design.**

In the last decade, however, an alternative model termed the drift–barrier hypothesis (DBH) has emerged as the prevailing hypothesis of mutation rate evolution¹². The DBH considers only the first of the three selections aforementioned and regards the selections for higher μ negligible. The DBH posits that, as μ diminishes, the selective benefit of a given fractional reduction of μ also diminishes; eventually, the benefit becomes so weak relative to genetic drift that μ can no longer descend, especially when mutations are biased toward creating modifiers that increase μ. The minimal μ achievable by selection against deleterious mutations in the presence of drift is known as the drift barrier.

Which of these competing hypotheses provides the right answer to Sturtevant’s question? While a key prediction of the DBH has been validated (see “Discussion”), it is unknown whether the selections for higher μ, which have received empirical support (see “Discussion”), are so inconsequential that μ approaches the drift barrier. Because μ would be subject to stabilizing selection under the conventional model but directional selection under the DBH, it is possible to distinguish between them by evaluating the type of selection acting on μ. Another key aspect of mutation is its molecular spectrum, defined by the relative rates of different mutation types. Mutation spectrum could affect the severity of mutations¹⁶ and influence adaptation¹⁷, but whether the mutation spectrum itself is subject to natural selection is unknown.

In this work, we use the budding yeast Saccharomyces cerevisiae as a model to assess selections in the evolution of mutation rate and spectrum. We show that yeast’s mutation rate is maintained well above the drift barrier by stabilizing selection and that its mutation spectrum has also been shaped by selection.

Results

Mutagenesis frequently reduces μ

The type of selection acting on a trait can be inferred from its phenotypic change upon the removal of the selection. If μ has been selectively minimized to the drift barrier, evolution in the absence of selection should generally cause a rise in μ, although occasional small reductions in μ cannot be excluded¹⁸ (Fig. 1b). By contrast, if the mutation rate is selectively maintained well above the drift barrier, upon the removal of selection, the probabilities for μ to go up and go down are both substantial (Fig. 1c). Following this logic, we initiated 96 mutation accumulation (MA) lines from a commonly used laboratory yeast strain, in which the mismatch repair gene MSH2 had been deleted to speed up the accumulation of mutations (see “Methods”). Each MA line went through on average 1511 mitotic generations including 80 evenly spaced single-cell bottlenecks to minimize the effective population size (N_e) and selection; 93 lines survived the MA (Fig. 1d).

As expected, yeast growth significantly slowed after the MA; the average growth rate dropped by 41% (Supplementary Fig. 1a). Whole-genome sequencing (WGS) showed that, on average, each MA line accumulated 879 mutations, including 115 single nucleotide variants (SNVs) and 764 insertions/deletions (indels) (see “Methods”), consistent with the known mutation spectrum of MSH2-lacking strains¹⁹. An average of 186 genes was hit by mutations per MA line (Supplementary Fig. 1b, Supplementary Table 1, Supplementary Data 1). Consistent with the report that MSH2-lacking strains suffer from increased rates of indel mutations in homonucleotide runs²⁰, 78% of mutations in our MA lines were shared by at least two lines and almost all of the shared mutations were indels located in homonucleotide runs (see “Methods”).

Because the N_e of the MA lines was about 10 (see “Methods”) while most mutations are expected to have a fitness effect on the order of 1% or smaller²¹, selection should be infrequent during the MA and was indeed the case (see “Methods”). To assess the impact of MA on μ, we first inserted MSH2 back to the MA lines, which was accomplished in 60 of the 93 lines (Fig. 1d). Using the classic fluctuation test based on the reporter gene CAN1 (see “Methods”), we successfully measured μ in the progenitor as well as 49 of the above 60 MA lines, all carrying an intact MSH2 (Fig. 1d). We subsequently found that five of the 49 lines were likely diploidized upon the insertion of MSH2 (see the following paragraph) and excluded them (marked with stars in Fig. 2a) from all CAN1-based analyses. We found μ of the 44 remaining MA lines to range from 0.01 to 26 times that of the progenitor (Fig. 2a, Supplementary Data 1), including 19 lines with significantly higher μ and 13 lines with significantly lower μ than the progenitor (see “Methods”). Furthermore, 10 of the 13 lines with significantly decreased μ had μ reduced by at least 50%, while the remaining three lines had μ reduced by 40% to 43% (Fig. 2a). That over 40% of MA lines with significantly altered μ exhibit such drastic reductions in μ is inconsistent with the DBH, because when μ is near the drift barrier, mutations are expected to be strongly biased toward increasing μ and are not expected to cause such large reductions of μ so frequently (Fig. 1b). Our finding suggests that the progenitor’s μ is well above the drift barrier (Fig. 1c). We found a significant positive correlation between μ and the number of mutations accumulated during the MA, but μ was not significantly correlated with the growth rate of the MA line (Supplementary Fig. 2).

**Fig. 2: Mutation frequencies and rates of the MA lines.**

Because the above estimation of μ was based on loss-of-function mutations in one gene, we attempted to verify these results by performing another round of MA followed by WGS in 16 of the above 49 MA lines as well as the progenitor (and a diploid version of the progenitor), all with an intact MSH2 (Fig. 1d). 4–20 parallel lines were established from each strain, and on average 684 generations of MA were performed in the medium similar to that used in the fluctuation test (Supplementary Table 2, Supplementary Data 1, see “Methods”). Four of the 16 MA lines were apparently diploid, because the majority of the mutations observed in MA + WGS were in heterozygous state. Diploids should not produce mutant colonies in the fluctuation test. To be conservative in inferring mutation rate reductions in the first round of MA, we additionally regarded a line that was not subject to MA + WGS but had only two mutant colonies in the fluctuation test as putatively diploid (right most line marked with a star in Fig. 2a). We excluded these five diploid lines from all CAN1-based analyses. Because haploid and diploid progenitors showed similar mutation rates (Fig. 2b, c), all 16 lines with MA + WGS were included in analyses based solely on MA + WGS. The MA + WGS results were generally consistent with those from the fluctuation test. For instance, compared with the progenitor, all eight lines with higher CAN1-based μ exhibited higher MA + WGS-based SNV (Fig. 2b) or indel (Fig. 2c) rates. Among the four lines with lower CAN1-based μ, three exhibited significantly lower MA + WGS-based SNV or indel rates (Fig. 2a). CAN1-based μ is significantly correlated with both the MA + WGS-based SNV rate (r = 0.88, P = 8.9 × 10⁻⁵; Fig. 2d) and the MA + WGS-based indel rate (r = 0.78, P = 1.6 × 10⁻³; Fig. 2e), although the latter correlation is weaker than the former. This observation is not unexpected given that most loss-of-function mutations in CAN1 are SNVs instead of indels²².

Stabilizing selection of μ

The above analysis strongly suggests that μ is not selectively minimized to the drift barrier in the progenitor. To assess the selective forces acting on μ, we took advantage of published CAN1-based μ estimates from seven natural yeast strains of diverse origins¹⁴ (Supplementary Table 3, Supplementary Fig. 3). For a neutrally evolving trait, the ratio of its genetic variance among natural strains of a species (V_g) to the mutational variance generated by mutations per generation (V_m) is expected to equal 4N_e in primarily asexual diploids such as S. cerevisiae²³. By contrast, stabilizing selection would reduce V_g and render V_g/V_m smaller than 4N_e. Because μ is not normally distributed among the MA lines, we first log₁₀-transformed μ (Fig. 2a) before computing V_m and V_g, although the results are not qualitatively different without the transformation (Supplementary Table 4). We estimated V_g of μ from the seven natural strains. To estimate V_m that is comparable with V_g, we used the CAN1-based μ estimates from the 44 haploid MA lines, but corrected for the increased mutagenesis in the MA induced by deleting MSH2. We employed three corrections by respectively assuming that deleting MSH2 caused the same fold change in the rate of each mutation type as the observed fold change of the total rate of SNVs and indels (V_m1), as that of indels (V_m2), and as that of SNVs (V_m3). Because deleting MSH2 increased the indel rate much more than increasing the SNV rate (see “Methods”), among the three V_m values, V_m1 is probably the closest to the truth, while V_m2 is underestimated and V_m3 is overestimated. Hence, V_m3 and V_m2 allow determining the lower and upper bounds of V_g/V_m, respectively.

We found V_g/V_m to be at least 540 times lower than the neutral expectation of 4N_e ≈ 4 × 10⁷ (see “Methods”), regardless of the particular V_m used (Table 1), indicating strong stabilizing selection of μ. This signal of stabilizing selection is not an artifact of the physical limits of μ, because the range of μ among the natural strains is even smaller than that of the MA lines (Fig. 2a). To investigate whether the stabilizing selection prohibits the evolution of higher μ, lower μ, or both, we separated V_m into two components that respectively measure the variance of μ created by mutations decreasing μ (V_mL) and increasing μ (V_mH). If there is no selection against a reduction in μ, V_g should be at least as large as 4N_eV_mL. However, we found V_g to be at least 300 times lower than 4N_eV_mL (Table 1), indicating the action of selection prohibiting a reduction of μ in evolution. Similarly, V_g was at least 230 times lower than 4N_eV_mH (Table 1), indicating the action of selection prohibiting a rise of μ in evolution. In the above tests, the smallest difference observed between V_g and a neutral expectation was 230 times, based on V_m2 that corresponds to a conservative test. Therefore, it is exceedingly unlikely that our test results are due to confounding factors such as mutation spectrum differences between wild-type and MSH2-lacking strains or the inaccuracies of V_m, V_g and N_e estimates (see “Methods”). Together, the above results demonstrate that μ has been selectively maintained at an intermediate level in S. cerevisiae Note that the selective forces to increase and to suppress μ are not equally strong, because the mean μ of the MA lines is higher than the progenitor (P = 4.3 × 10⁻³ for CAN1-based μ, t-test; P = 5.7 × 10⁻⁷ for WGS-based SNV rate, t-test).

Table 1 Test of stabilizing selection of the mutation rate in yeast.

Full size table

To examine if the above finding extends beyond the species concerned, we examined the evolution of μ in the divergence between S. cerevisiae and its sister species S. paradoxus. S. paradoxus’ SNV rate was recently estimated by MA + WGS to be 7.27 × 10⁻¹¹ per site per generation²⁴, about one third that of S. cerevisiae¹⁵. Under neutral evolution, the squared difference in mutation rate between the two species (D²) should equal V_m times the number of generations separating the two species (T)²⁵, which we have estimated to be 2.89 × 10⁹ (see “Methods”). We obtained V_m based on the 16 MA lines with MA + WGS-based estimates of SNV rates and corrected the impact of deleting MSH2 as in the above analysis. We found D²/V_m to be at least 4000 times smaller than T (Table 1). We also respectively estimated V_mL and V_mH using the 16 MA lines with MA + WGS-based estimates of μ. Again, we found D²/V_mL to be at least 400 times smaller and D²/V_mH at least 2400 times smaller than T (Table 1), demonstrating selection against lowering as well as increasing μ in the divergence of Saccharomyces species. The above conclusion holds irrespective of whether the SNV rates are log₁₀-transformed (Table 1) or not (Supplementary Table 4).

μ is well above the drift barrier in diverse organisms

To directly confirm that μ is maintained above the drift barrier, we estimated the drift barriers for a diverse set of organisms including S. cerevisiae (see “Methods”). The drift barrier is commonly considered in terms of the number of mutations per functional genome per generation (U), which equals μG, where G is the size of the functional genome or the number of nucleotides where mutations would be subject to selection. Although the drift barrier varies by the parameters assumed, our estimates were based on the best available information that was also used in the formulation of the BDH¹⁸. In every species examined, U is substantially higher than the drift barrier, often by one to several orders of magnitude (Table 2). For example, S. cerevisiae’s U is over 3000 times the estimated drift barrier.

Table 2 Observed SNV mutation rates per functional genome per generation and the corresponding drift barriers of model organisms.

Full size table

Note that estimating the drift barrier requires knowing N_e, which is typically inferred from the synonymous nucleotide diversity under the assumption that synonymous mutations are neutral (see “Methods”). If synonymous mutations are overall slightly deleterious as has been suggested²⁶, N_e would have been underestimated and drift barrier overestimated, rendering the true difference between the observed U and the drift barrier even larger than that shown in Table 2. In other words, our conclusion based on Table 2 is conservative.

Discovery of PSP2 as a mutator gene

Antimutator genes lower μ, so studying them helps understand the molecular basis of high replication fidelity. About 30 antimutator genes are known in S. cerevisiae²⁷. By contrast, mutator genes, whose normal functions are to increase μ, have not been reported. Note the distinction between mutator genes and mutator alleles, the latter being loss-of-function alleles of antimutator genes. That μ is substantially lower in some MA lines than the progenitor suggests the existence of mutator genes that were crippled in MA; their discovery would help understand the mechanisms of mutagenesis and μ regulation. We originally identified candidate mutator genes by screening genes that were more frequently mutated in low-μ lines than in high-μ lines among MA lines with CAN1-based μ estimates (Supplementary Table 5), and picked four candidates (RAD9, YFL013W-A, PSP2, and MSH4) based on their ranks from the screening and annotated functions for a follow-up study. We subsequently found that some MA lines had erroneous MSH2, so reinserted MSH2 followed by re-estimation of CAN1-based μ. Based on these new estimates of μ (Fig. 2a), the four genes are ranked 1362, 911, 58, and 76, respectively. We respectively knocked out these four genes in the progenitor and measured the CAN1-based μ. We found the PSP2-lacking strain to exhibit a significantly reduced μ (P < 0.05; Fig. 3a) and confirmed it by additional replications (P = 0.0095, Wilcoxon rank-sum test; Fig. 3b). That removing PSP2 reduces μ by 42% (Fig. 3b) suggests that it is a major mutator gene. PSP2 (polymerase suppressor 2) was originally discovered from a screening of rescuers of heat-sensitive mutations in POL1 and POL3, which encode the catalytic subunit of DNA polymerase I and δ, respectively²⁸. PSP2 is an RNA-binding protein and promotes P-body assembly^29,30. Under nitrogen starvation, PSP2 binds to the mRNAs of ATG1 and ATG13 to promote their translation and autophagy; deleting PSP2 reduces the synthesis of ATG1 and ATG13, autophagy activity, and cell survival³¹. Yeast grew faster under some conditions but slower under other conditions upon PSP2 deletion³² (Fig. 3c). Note that removing PSP2 reduced μ in a medium similar to SC (synthetic complete) (Fig. 3b), where the knockout slowed yeast growth (Fig. 3c), but the causal relationship between the slowed growth and reduced μ is unclear.

PSP2 can be divided into three segments: the N-terminal segment has unknown functions, the middle segment interacts with translation initiation factors, and the C-terminal segment harbors four RGG motifs and binds to RNAs (Fig. 3d); the middle and C-terminal segments are required for PSP2’s role in autophagy³¹. We respectively deleted from the progenitor the DNA sequences corresponding to the three segments of PSP2, followed by CAN1-based μ estimation. We found that the N-terminal and C-terminal segments but not the middle segment are required for PSP2’s activity in increasing μ (Fig. 3e), suggesting that PSP2 regulates μ through RNA binding but not protein interaction.

Mutation spectrum has been shaped by selection

To investigate the potential role of natural selection in shaping yeast’s mutation spectrum, we compared the variance (V_g) in a component of the mutation spectrum among five divergent natural yeast strains having published MA + WGS data (Supplementary Fig. 3), with the corresponding mutational variance per generation estimated from the 16 MA lines with MA + WGS data. Because haploid and diploid progenitors show similar mutational spectrums (Fig. 4), we analyze the MA lines and natural strains regardless of their ploidy. Even under the most generous calculation, V_g/V_m (3.07 × 10⁴) of the proportion of mutations that are SNVs is orders of magnitude smaller than the neutral expectation of 4 × 10⁷ (Supplementary Table 6). In fact, the variance of the proportion of SNVs is smaller among the five natural strains than among the 16 MA lines (Fig. 4a), despite that the numbers of generations separating the natural strains are much greater than those separating the MA lines even after the correction for the increased mutagenesis of MA lines induced by deleting MSH2. Similar results were found regarding the proportion of insertions (maximal V_g/V_m = 3.57 × 10³) and that of deletions (maximal V_g/V_m = 3.27 × 10⁴) (Fig. 4a, Supplementary Table 6). Thus, the fractions of SNVs, insertions, and deletions among all mutations have been under stabilizing selection. Note, however, that because the three fractions must add to 1, the three fractions may not be subject to three separate stabilizing selections.

**Fig. 4: Molecular spectra of mutations in 16 MA lines and 5 natural yeast strains estimated by MA + WGS.**

There are six different types of SNVs (Fig. 4b) and we found evidence for stabilizing selections on each of the six fractions (maximal V_g/V_m ranging between 3.03 × 10⁴ and 1.35 × 10⁶; Supplementary Table 6). Again, there may not be six separate stabilizing selections because the six fractions must add up to 1.

Two mutational biases are of special interest because of their universal presence across the tree of life. The first is the transition/transversion (Ts/Tv) bias. Transitions are changes between purines or between pyrimidines, whereas transversions are changes between a purine and a pyrimidine. In almost all species examined, the mutational Ts/Tv ratio exceeds 0.5, the random expectation³³. We found evidence for stabilizing selection of Ts/Tv (Fig. 4c); the maximal V_g/V_m equals 9.13 × 10⁴ (Supplementary Table 6). In particular, Ts/Tv is significantly higher (or lower) than that of the progenitor in two (or zero) MA lines (Fig. 4c). Hence, the stabilizing selection appears to have mainly kept the mutational Ts/Tv ratio low.

The second bias, known as the AT mutational bias, refers to the observation that GC → AT mutations outnumber AT → GC mutations. The universality of this bias across all species examined has led to the belief that it arises from the chemical nature of DNA irrespective of variations in replication and repair mechanisms³⁴. We found that, in one MA line, the ratio of the number of GC → AT mutations to the number of AT → GC mutations is significantly different from that in the progenitor, and is reversed from >1 to <1 (Fig. 4d). Clearly, the AT mutational bias is subject to genetic control and is not a chemical necessity. Furthermore, V_g/V_m for the AT mutational bias is at least 120 times lower than the neutral expectation (Supplementary Table 6), indicating that the bias has been maintained by stabilizing selection. Stabilizing selection on mutation spectrum is evident regardless of whether we log₁₀-transform the original trait values (Supplementary Table 7) or not (Supplementary Table 6).

Discussion

Consistent with the prediction of the DBH, a strong negative correlation between U and N_e was previously observed across diverse organisms¹², but several considerations suggest that the actual correlation is likely substantially weaker³⁵ (see “Methods”). Regardless, our finding in yeast that μ is selectively maintained well above the drift barrier refutes the DBH and suggests the presence of the first- and/or second-order selection for higher μ (Fig. 1a). Indeed, experiments in Escherichia coli found that genotypes with relatively high μ often outcompete those with relatively low μ in 100 generations of evolution despite their lack of difference in fitness prior to the evolution³⁶. Similarly, a S. cerevisiae mutator strain surpassed the wild-type in ~250 generations of evolution in large but not small co-cultures³⁷. In Lenski’s 50,000-generation E. coli experimental evolution in a low-glucose environment, 6 of 12 populations increased in μ and they adapted faster than the other 6 populations³⁸. These observations support that modifiers raising μ can be fixed as a result of second-order selection. Furthermore, under certain conditions, the optimal U resulting from the two opposing second-order selections (Fig. 1a) is predicted to decline with N_e in asexuals (Supplementary Fig. 4, see “Methods”), which could partially explain the reported negative correlation between U and N_e¹². Nonetheless, the predicted optimal U given N_e appears lower than the corresponding observed U (Supplementary Fig. 4). Furthermore, even in asexuals, the second-order selection for higher μ is episodic depending on the environment and the frequency of beneficial mutations, so μ likely fluctuates under its influence^7,8,11. In sexuals, this selection is expected to be ineffective, because the mutation rate modifier becomes quickly unlinked with the beneficial mutation created and loses its selective advantage^6,9.

Experiments in several viruses have discovered a tradeoff between the speed and fidelity of genome replication^39,40, providing direct evidence for a cost of fidelity. We found that PSP2, a positive regulator of yeast autophagy, increases μ and that the autophagy-promoting and μ-increasing activities both rely on PSP2’s RNA-binding domain. While the mechanistic connection between these two activities is unclear, our finding suggests that the cost of fidelity could result from pleiotropy⁴¹; deleting PSP2 increases fidelity (Fig. 3b) but impairs autophagy and slows cell growth³¹. It is, however, unknown whether the fitness cost of fidelity is correlated with N_e so could potentially create a negative correlation between U and N_e, which seems possible at least in multicellulars. For example, N_e is about ten times higher in mouse than in human (Table 2). There are fewer germ cell divisions per generation in mouse than in human, potentially rendering the demand and thus the cost of fidelity per cell division lower in mouse than in human; consistently, mutation rate per cell division is higher in mouse than in human⁴². Consequently, the fitness cost of fidelity per generation is also lower in mouse than in human, predicting a lower μ in mouse than in human (Fig. 1a), as is observed⁴². If the trend in human and mouse is generalizable, which seems plausible given the correlations between N_e and many life-history traits, μ would generally decrease with N_e. Furthermore, G is strongly negatively correlated with N_e (see “Methods”). So, under the above model, U would have a rather strong negative correlation with N_e. Hence, the DBH is not the only hypothesis that could explain a strong negative correlation between U and N_e if this correlation truly exists (see “Methods”).

Our data do not allow us to distinguish between the first-order and second-order selections that lift μ from the drift barrier. Considering past theoretical results^7,8,9,11, we suggest that the first-order selection is more likely responsible for a μ that is stably above the drift barrier, while the second-order selection may further raise it episodically. While our experiments focused on yeast, the finding of a higher U than the drift barrier across diverse organisms (Table 2) suggests that our conclusion is likely general. That the observed mutation rates of natural organisms are orders of magnitude above the theoretical minimums suggests the possibility of lowering their mutation rates through genome editing, which would have both theoretical and practical values.

Our data also provide evidence that the molecular spectrum of mutation has been selectively shaped, but it is unknown whether the selections directly or indirectly act on the spectrum and what the selective agents are. Further studies are needed to answer these questions. Mutation and selection are generally considered distinct evolutionary forces. Our finding that both the rate and spectrum of mutation are determined by actions of natural selection somewhat blurs the separation of mutation from selection, which may offer new insights into evolution.

Methods

Strains and genetic manipulations

We knocked out the MSH2 gene from the haploid BY4741 strain of S. cerevisiae, referred to as the progenitor, by homologous recombination with KanMX, followed by selection on YPD (1% yeast extract, 2% peptone, and 2% dextrose) plates with 0.5 g/L G418. The knockout of MSH2 was confirmed by Sanger sequencing. This MSH2-lacking strain was used to initiate the first round of MA. Upon the completion of the MA, MSH2 was inserted back to the resultant strains using CRISPR-Cas9 genome editing⁴³. Specifically, the wild-type MSH2 from BY4741 was used as the repair fragment, and three different guide RNAs targeting KanMX were used. Transformation was performed in each MA line for up to three times and was confirmed by Sanger sequencing of the reinserted locus. Restoration of MSH2 was successful in only 60 of the 93 MA lines, probably due to reduced transformation efficiencies in the MA lines.

Four candidate mutator genes (RAD9, YFL013W-A, PSP2, and MSH4) and three segments of PSP2 were also respectively removed from the progenitor (with intact MSH2) using CRISPR-Cas9. The start (or stop) codon was left unchanged when we removed the DNA of the N-terminal (or C-terminal) segment of PSP2. Primers used in this study can be found in Supplementary Table 8.

MA and whole-genome sequencing

The strategy of two rounds of MA was previously used in animals to probe the change of μ as a result of MA^44,45. In the first round of MA in our study, 96 parallel lines were established from the BY4741 without MSH2. Cells were propagated at 30 °C on YPD plates. A single-cell bottleneck was applied to each line every 48 h, where a randomly picked average-size colony was streaked onto a new plate. Each line went through a total of 80 bottlenecks, and 93 of the 96 lines survived in the end. The total number of generations each MA line went through was estimated by the number of generations between bottlenecks multiplied by the number of bottlenecks. We estimated the number of generations between bottlenecks by counting the number of cells in an average-size colony and assuming exponential growth, and took the average of the estimates prior to and after MA. The N_e of an MA line equals the harmonic mean of the number of cells per generation. In our experiment, the average between-bottleneck number of generations is 21, so N_e equals 21/(1/1 + 1/2 + 1/2² + …+ 1/2²¹) ≈10. The genomes of the 93 MA lines and the progenitor in the MSH2-lacking background were sequenced.

A total of 18 strains, including 16 of the above 93 MA lines, BY4741 (haploid), and BY4743 (diploid), all with intact MSH2, were subject to the second round of MA. Four to 20 replicate lines were established for each strain. Cells were propagated at 30 °C on SC (synthetic complete) plates, similar to that used in the fluctuation test. The total time in the second round of MA for all lines was kept at ~100 days and the number of generations between bottlenecks was kept at ~20. The between-bottleneck duration was different among these 18 strains because of their different generation times. It was 48 h in BY4741, BY4743, MA28, and MA38, 72 h in MA15, MA21, MA23, MA25, MA29, MA33, MA44, MA63 and MA92, and 96 h in MA45, MA51, MA56, MA64, and MA94. The genomes of 209 MA lines at the end of the second round of MA and their 18 ancestral strains were sequenced. The number of generations each MA line went through was estimated in the same way as in the first round of MA.

For each sample to be sequenced, the genomic DNA was extracted using MasterPure Yeast DNA Purification Kit (Lucigen; Cat. No. MPY80200). Library was constructed using Nextera DNA Flex Library Prep kit (Illumina; Cat. No. 20018705). Paired-end reads (2 × 150 bases) were generated on Illumina Hiseq 4000 platform by Admera Health (www.admerahealth.com).

Identification of mutations and verification by Sanger sequencing

Sequencing reads from each sample were first mapped to the S. cerevisiae reference genome (version R64-2-1) by Burrows-Wheeler Aligner⁴⁶. Duplicate marking and local realignment around indels were carried out using Genome Analysis Toolkit (GATK)⁴⁷. SNVs and indels shorter than 50 nucleotides were called by GATK HaplotypeCaller. Variants that differ between each MA line and its ancestral strain were retained when they met the following criteria: (i) a variant must be homogeneous because the MA lines in this study were haploid, (ii) a variant site must be covered by at least five reads in both the MA line and the ancestor, (iii) a variant must be supported by both forward and reverse reads, and (iv) a variant must have a quality score no lower than 50. Mutation rate was computed by (number of mutations in a sample)/(number of callable sites)/(number of generations in MA), where callable sites were defined as genomic sites covered by at least five reads.

Twenty shared mutations between MA lines identified in the first round of MA were randomly chosen for verification by Sanger sequencing. For each mutation, Sanger sequencing was performed in both the sample with the mutation and its ancestor, and the mutation was considered confirmed by Sanger sequencing if both results agreed with the results from Illumina sequencing. Polymerase chain reaction and Sanger sequencing were successful in 16 of the 20 cases, and 15 of the 16 mutations were confirmed by Sanger sequencing.

The impact of deleting MSH2 on mutation rates were assessed by comparing the mutations accumulated in the first round of MA in the MSH2-lacking background with those in the second round of MA of the progenitor with intact MSH2. Deleting MSH2 increased the SNV rate per site per generation by 16 times, indel rate per site per generation by 580 times, and the total rate of SNVs and indels by 104 times.

Confirmation of the infrequency of selection in the first round of MA

Because the N_e of the MA lines was about 10, while most mutations are expected to have a fitness effect on the order of 1% or smaller²¹, selection should be infrequent during the MA. These infrequent selections likely concentrated in the one-sixth of yeast genes known as essential genes, because loss-of-function mutations in essential genes cannot accumulate. To confirm the infrequency of selection, we compared the genomic distributions of the observed mutations in the 93 MA lines with the corresponding random expectations. The fraction of SNVs located in genic regions is 73%, slightly but significantly below the random expectation of 74% (P = 0.0046, binomial test). The fraction of coding SNVs that are nonsynonymous is 70%, also slightly but significantly below the random expectation of 76% (P < 0.001, binomial test). While 16% of all homonucleotide runs reside in genic regions, a slightly lower fraction of indels (14%) occurred in genic regions (P = 0.002, chi-squared test). Together, these results confirm that selection was present but infrequent in our MA experiment. The infrequent selection may cause a slight underestimation of V_m (under both the DBH and conventional model), rendering our inference of stabilizing selection more conservative.

Fluctuation test

CAN1-based fluctuation test was performed following Lang⁴⁸ with a few modifications. CAN1 encodes an arginine transporter; cells must carry loss-of-function mutations in CAN1 to be able to grow in the presence of canavanine, a toxic arginine analog. The strain being tested was precultured in SC-Arg liquid medium for 48 h. Cells were then diluted and transferred to a 96-well plate with an initial cell number of ~1000 per well. Each well in the 96-well plate contained 100 μl fresh SC-Arg medium. The plate was sealed with an aluminum film and incubated at 30 °C with shaking for 72 h. Then, 72 100-μl cultures were spot-plated onto the selection plates (SC-Arg with 60 μg/ml canavanine), while the remaining 24 100-μl cultures were pooled followed by cell counting by a hemocytometer. About 1000 cells from this pool were plated onto a SC-Arg plate (without canavanine) to test plating efficiency. The canavanine plates with cell cultures were first dried in a sterile hood, followed by incubation at 30 °C for 72 h. Finally, the mutant colonies on each plate were manually counted. Because the growth rates of the MA lines were low, the incubation time in this step was longer than the usually used 49 h. We subjected all 60 MA lines with reinserted MSH2 to the fluctuation test, but only 49 of them grew in the medium. We also subjected the progenitor (before the deletion of MSH2) to the fluctuation test.

CAN1-based μ was estimated by bz-rates⁴⁹, a web tool that uses an empirical probability generating function to estimate the number of mutations per culture⁵⁰ with correction for plating efficiency. The mutation frequency presented is the probability of loss-of-function mutation in CAN1 per cell division. No CAN1 mutant was observed in two MA lines, and their μ values were calculated by assuming the observation of one mutant colony to allow plotting μ in a logarithmic scale. The same practice was employed in estimating V_m, which rendered our selection tests more conservative. The number of mutants per culture in the fluctuation test follows the Luria–Delbrück distribution, which is a highly skewed, non-normal distribution⁵⁰. To statistically evaluate the difference in μ between two strains, we used the 95% confidence intervals (CI) of μ estimated by bz-rates⁴⁹; two strains with non-overlapping 95% CIs were regarded as having significantly different μ.

N_e of S. cerevisiae

The SNV rate of S. cerevisiae was estimated from MA lines to be μ = 1.95 × 10⁻¹⁰ per site per generation in YPD¹⁵. A species-wide population genomic survey of S. cerevisiae⁵¹ found that the nucleotide diversity per site (π) is substantially lower at nonsynonymous (0.0014), intronic (0.0027), and intergenic (0.0037) sites than at synonymous sites (0.0091). Under the assumption that synonymous mutations are neutral, N_e was estimated by π_S/(4μ) = 1.17 × 10⁷. It is possible that π_S is smaller than the neutral nucleotide diversity because of selection at synonymous sites, which renders our estimate of N_e smaller than its true value and our inference of stabilizing selections of mutation rates and spectrum conservative.

Estimation of V _m, V _g, and D ²

V_m of a phenotypic trait such as μ is the variance of μ among MA lines per generation. V_mL (or V_mH) is the corresponding variance calculated using only MA lines with lower (or higher) μ than that of the progenitor. Because μ is higher in the MSH2-lacking MA lines than in natural strains, we employed three corrections by respectively assuming that deleting MSH2 caused the same fold change in the rate of each mutation type as the observed fold change of the total rate of SNVs and indels (V_m1), as that of indels (V_m2), and as that of SNVs (V_m3). Specifically, the corrected numbers of generations became 119,850 in estimating V_m1, 667,636 in estimating V_m2, and 18,578 in estimating V_m3, respectively. In the top half of Table 1, V_m, V_mL, and V_mH were estimated using the CAN1-based μ of MA lines from the present study, while V_g was estimated using published CAN1-based μ of seven diverse natural strains of S. cerevisiae¹⁴ (Supplementary Table 3, Supplementary Fig. 3). In the bottom half of Table 1, V_m, V_mL, and V_mH were estimated using the MA + WGS-based SNV rates of MA lines from the present study, while D² was the squared difference in MA + WGS-based SNV mutation rate between S. cerevisiae¹⁵ and S. paradoxus²⁴. In the above analysis, we either log₁₀-transformed μ before computing V_m, V_g, and D² (Table 1) or used the original μ values without transformation (Supplementary Table 4).

In Supplementary Table 6, V_m was estimated using the MA + WGS-based mutation rates of MA lines from the present study, while V_g was estimated using published MA + WGS-based mutation rates of five diverse natural strains of S. cerevisiae^19,24,52,53 (Supplementary Fig. 3). We presented results from both log₁₀-transformed values (Supplementary Table 7) and untransformed values (Supplementary Table 6).

To test if V_g/V_m is significantly smaller than the neutral expectation, we bootstrapped MA lines as well as natural strains 10,000 times; P-value is the fraction of times when V_g/V_m computed from a bootstrap sample exceeds the neutral expectation. To test if D²/V_m is significantly smaller than the neutral expectation, we bootstrapped MA lines 10,000 times; P value is the fraction of times when D²/V_m computed from a bootstrap sample exceeds the neutral expectation. The V_g and V_m calculated here are phenotypic variances, including the genetic component and estimation error, because the environment is fixed. The phenotypic variance caused by estimation error should be similar for natural strains and MA lines because of the use of the same phenotyping method. Because the phenotypic variance is greater for MA lines than for natural strains, the fraction of phenotypic variance contributed by genetics is greater for MA lines than for natural strains. Hence, V_g/V_m is overestimated when computed using phenotypic variance instead of genetic variance, which renders our conclusion that V_g/V_m is smaller than the neutral expectation conservative.

Number of generations separating S. cerevisiae and S. paradoxus

The number of generations separating S. cerevisiae and S. paradoxus was estimated by dividing the nucleotide sequence divergence between the two species per synonymous site (d_S) by the mean SNV mutation rate of S. cerevisiae and S. paradoxus. Here, d_S has been estimated to be 0.3868⁵⁴ and the reported SNV mutation rates in these two species are 1.95 × 10⁻¹⁰ and 7.27 × 10⁻¹¹ per site per generation, respectively^15,24. Hence, the number of generations separating the two species is 2.89 × 10⁹. As noted above, d_S may underestimate the neutral divergence between the two species, which renders our stabilizing selection conclusion conservative.

The mutation rate drift barriers of various species

For (haploid) asexuals, the drift barrier (U₀) is reached when the mutation rate reduction per functional genome per generation by a modifier (ΔU = λU₀) equals 1/N_e, where λ is the fractional reduction of the mutation rate and is assumed to be 0.1¹⁸. Hence, the drift barrier is U₀ = 10/N_e. For diploid asexuals, because ΔU = 2λU₀, U₀ = 5/N_e. Let Δm be the per generation rate of mutational production of modifiers decreasing the mutation rate minus the corresponding rate of production of modifiers increasing the mutation rate. When U approaches U₀, Δm is likely negative, which will cause an increase in U₀¹⁸. However, because the magnitude of Δm is expected to be much smaller than U₀¹⁸, the effect of Δm on U₀ is negligible.

For diploid sexuals at the drift barrier, ΔU = 2λU₀ = 1/(2N_es), where s is the mean selective disadvantage of deleterious mutations in the heterozygous state¹⁸ and is assumed to be 0.01²¹. So, U₀ = 1/(4N_esλ) = 250/N_e. In sexuals, the effect of Δm on U₀ is amplified by 1/s times, so it is possible that U₀ is increased to some extent due to a negative Δm. However, it is extremely unlikely that the absolute value of Δm/s = 100Δm exceeds 5U₀, because it is difficult to imagine that more than 5% of mutations create mutation rate modifiers. Therefore, U₀ should not exceed 6 times the above estimate, or 1500/N_e.

We obtained the N_e estimates of E. coli⁵⁵, Bacillus subtilis¹², Schizosaccharomyces pombe⁵⁶, Chlamydomonas reinhardtii⁵⁷, Arabidopsis thaliana⁵⁸, Drosophila melanogaster⁵⁹, Mus musculus⁶⁰, and Homo sapiens⁶¹ from the literature. The N_e of S. cerevisiae was assumed to be 10⁷, as estimated above. S. cerevisiae is considered a diploid asexual because it is normally a diploid but undergoes rare sexual reproduction (once per 1000 generations) in the wild⁶². S. pombe is considered a haploid asexual because it is normally a haploid and very rarely undergoes sexual reproduction (once per 0.6 million generations) in the wild⁶². Arabidopsis thaliana has become largely selfing since about 1 million years ago⁶³, but we here still consider it outcrossing so that our argument for a U that is higher than the drift barrier is more conservative. Chlamydomonas reinhardtii is normally a haploid, but how often it undergoes sexual reproduction in nature is unknown⁶⁴. To be conservative in our argument, we considered it a sexual haploid.

The observed U is measured by the number of SNV mutations per functional genome per generation. The functional genome size G is the number of nucleotides in the genome that are subject to natural selection. To be conservative, G was assumed to equal the total coding sequence length except for D. melanogaster, M. musculus, and H. sapiens. For these three species, G was estimated by the total number of nucleotides in autosomes multiplied by the faction of sites under purifying selection, which was previously estimated to be 48.7% for D. melanogaster⁶⁵, 7.2% for M. musculus⁶⁵, and 8.2% for H. sapiens⁶⁶, respectively. We obtained the estimates of SNV mutation rate per site per generation of E. coli⁶⁷, B. subtilis⁶⁸, S. cerevisiae¹⁵, S. pombe⁶⁹, C. reinhardtii⁷⁰, A. thaliana⁷¹, D. melanogaster⁷², M. musculus⁴², and H. sapiens⁴² from the literature.

On the negative correlation between U and N _e

For three reasons, the cause of the reported¹² negative correlation between U and N_e is uncertain. First, N_e is typically estimated by dividing the neutral nucleotide diversity by kμ, where k is 4 for diploids and 2 for haploids, and the neutral nucleotide diversity is typically approximated by the synonymous nucleotide diversity π_S. Hence, any estimation error of μ influences the estimates of U and N_e in opposite directions, creating a spurious negative correlation between them³⁵. Second, synonymous mutations are not immune to selection²⁶. Due to the abundance of negative selection and the increase of selection intensity with N_e, it is possible that the neutral nucleotide diversity is underestimated by π_S and that the extent of the underestimation rises with N_e. In other words, the larger the N_e, the greater the underestimation of N_e, which increases the apparent effect of N_e on U. Finally, the negative correlation between U and N_e is partially due to the negative correlation between G and N_e. For instance, log₁₀N_e has a linear correlation coefficient of −0.61 (P = 4.0 × 10⁻⁴) with log₁₀(proteome size) and −0.77 (P = 1.1 × 10⁻⁶) with log₁₀(genome size) when analyzed using published data¹².

Optimal U under the two opposing second-order selections in asexuals

Raising U increases the number of beneficial mutations as well as that of deleterious mutations. Orr found that, in asexuals, the optimal U, which is the U value corresponding to the highest speed of adaptation, is the harmonic mean of the coefficient of selection (s > 0) against deleterious mutations⁷. Because deleterious mutations with s smaller than 1/N_e are effectively neutral, the range of s for mutations that are selected against enlarges with N_e, which causes a decrease of the harmonic mean of s and hence the optimal U with N_e. To show this trend numerically, we sampled 100,000 s values from a gamma distribution with mean equal to 0.01 and shape parameter α = 0.1, 0.2, or 0.5. The harmonic mean of s among all sampled s values that are larger than 1/N_e was computed. This harmonic mean is the optimal U. We considered various N_e values and various α values.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The sequencing reads generated have been deposited to NCBI SRA under the accession number PRJNA735524. All other data are presented in the paper and associated supplementary materials. Source data are provided with this paper.

Change history

04 April 2023
A Correction to this paper has been published: https://doi.org/10.1038/s41467-023-37354-7

References

Sturtevant, A. H. Essays on evolution. I. On the effects of selection on mutation rate. Q. Rev. Biol. 12, 467–477 (1937).
Article Google Scholar
Kimura, M. On the evolutionary adjustment of spontaneous mutation rates. Genet. Res. 9, 23–34 (1967).
Article ADS Google Scholar
Leigh, E. G. Jr. Natural selection and mutability. Am. Nat. 104, 301–305 (1970).
Article Google Scholar
Kondrashov, A. S. Modifiers of mutation-selection balance: general approach and the evolution of mutation rates. Genet. Res. 66, 53–69 (1995).
Article Google Scholar
Drake, J. W., Charlesworth, B., Charlesworth, D. & Crow, J. F. Rates of spontaneous mutation. Genetics 148, 1667–1686 (1998).
Article CAS PubMed PubMed Central Google Scholar
Sniegowski, P. D., Gerrish, P. J., Johnson, T. & Shaver, A. The evolution of mutation rates: separating causes from consequences. Bioessays 22, 1057–1066 (2000).
Article CAS PubMed Google Scholar
Orr, H. A. The rate of adaptation in asexuals. Genetics 155, 961–968 (2000).
Article CAS PubMed PubMed Central Google Scholar
Andre, J. B. & Godelle, B. The evolution of mutation rate in finite asexual populations. Genetics 172, 611–626 (2006).
Article CAS PubMed PubMed Central Google Scholar
Johnson, T. Beneficial mutations, hitchhiking and the evolution of mutation rates in sexual populations. Genetics 151, 1621–1631 (1999).
Article CAS PubMed PubMed Central Google Scholar
Lynch, M. The cellular, developmental and population-genetic determinants of mutation-rate evolution. Genetics 180, 933–943 (2008).
Article PubMed PubMed Central Google Scholar
Taddei, F. et al. Role of mutator alleles in adaptive evolution. Nature 387, 700–702 (1997).
Article ADS CAS PubMed Google Scholar
Lynch, M. et al. Genetic drift, selection and the evolution of the mutation rate. Nat. Rev. Genet. 17, 704–714 (2016).
Article CAS PubMed Google Scholar
Baer, C. F., Miyamoto, M. M. & Denver, D. R. Mutation rate variation in multicellular eukaryotes: causes and consequences. Nat. Rev. Genet. 8, 619–631 (2007).
Article CAS PubMed Google Scholar
Gou, L., Bloom, J. S. & Kruglyak, L. The genetic basis of mutation rate variation in yeast. Genetics 211, 731–740 (2019).
Article CAS PubMed Google Scholar
Liu, H. & Zhang, J. Yeast spontaneous mutation rate and spectrum vary with environment. Curr. Biol. 29, 1584–1591 (2019).
Article CAS PubMed PubMed Central Google Scholar
Zhang, J. Rates of conservative and radical nonsynonymous nucleotide substitutions in mammalian nuclear genes. J. Mol. Evol. 50, 56–68 (2000).
Article ADS CAS PubMed Google Scholar
Storz, J. F. et al. The role of mutation bias in adaptive molecular evolution: insights from convergent changes in protein function. Philos. Trans. R. Soc. Lond. B Biol. Sci. 374, 20180238 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lynch, M. The lower bound to the evolution of mutation rates. Genome Biol. Evol. 3, 1107–1118 (2011).
Article PubMed PubMed Central Google Scholar
Loeillet, S. et al. Trajectory and uniqueness of mutational signatures in yeast mutators. Proc. Natl Acad. Sci. USA 117, 24947–24956 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Tran, H. T., Keen, J. D., Kricker, M., Resnick, M. A. & Gordenin, D. A. Hypermutability of homonucleotide runs in mismatch repair and DNA polymerase proofreading yeast mutants. Mol. Cell Biol. 17, 2859–2865 (1997).
Article CAS PubMed PubMed Central Google Scholar
Lynch, M. et al. Perspective: Spontaneous deleterious mutation. Evolution 53, 645–663 (1999).
Article PubMed Google Scholar
Shor, E., Fox, C. A. & Broach, J. R. The yeast environmental stress response regulates mutagenesis induced by proteotoxic stress. PLoS Genet. 9, e1003680 (2013).
Article CAS PubMed PubMed Central Google Scholar
Lynch, M. & Hill, W. G. Phenotypic evolution by neutral mutation. Evolution 40, 915–935 (1986).
Article PubMed Google Scholar
Tattini, L. et al. Accurate tracking of the mutational landscape of diploid hybrid genomes. Mol. Biol. Evol. 36, 2861–2877 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lande, R. Natural selection and random genetic drift in phenotypic evolution. Evolution 30, 314–334 (1976).
Article PubMed Google Scholar
Chamary, J. V., Parmley, J. L. & Hurst, L. D. Hearing silence: non-neutral evolution at synonymous sites in mammals. Nat. Rev. Genet. 7, 98–108 (2006).
Article CAS PubMed Google Scholar
Huang, M. E., Rio, A. G., Nicolas, A. & Kolodner, R. D. A genomewide screen in Saccharomyces cerevisiae for genes that suppress the accumulation of mutations. Proc. Natl Acad. Sci. USA 100, 11529–11534 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Formosa, T. & Nittis, T. Suppressors of the temperature sensitivity of DNA polymerase alpha mutations in Saccharomyces cerevisiae. Mol. Gen. Genet. 257, 461–468 (1998).
Article CAS PubMed Google Scholar
Mitchell, S. F., Jain, S., She, M. & Parker, R. Global analysis of yeast mRNPs. Nat. Struct. Mol. Biol. 20, 127–133 (2013).
Article CAS PubMed Google Scholar
Rao, B. S. & Parker, R. Numerous interactions act redundantly to assemble a tunable size of P bodies in Saccharomyces cerevisiae. Proc. Natl Acad. Sci. USA 114, E9569–E9578 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Yin, Z. et al. Psp2, a novel regulator of autophagy that promotes autophagy-related protein translation. Cell Res. 29, 994–1008 (2019).
Article CAS PubMed PubMed Central Google Scholar
Qian, W., Ma, D., Xiao, C., Wang, Z. & Zhang, J. The genomic landscape and evolutionary resolution of antagonistic pleiotropy in yeast. Cell Rep. 2, 1399–1410 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zou, Z. & Zhang, J. Are nonsynonymous transversions generally more deleterious than nonsynonymous transitions? Mol. Biol. Evol. 38, 181–191 (2021).
Article CAS PubMed Google Scholar
Hershberg, R. & Petrov, D. Evidence that mutation is universally biased towards AT in bacteria. PLoS Genet. 6, e1001115 (2010).
Article PubMed PubMed Central Google Scholar
Wang, L. et al. Repeat-induced point mutation in Neurospora crassa causes the highest known mutation rate and mutational burden of any cellular life. Genome Biol. 21, 142 (2020).
Article CAS PubMed PubMed Central Google Scholar
Loh, E., Salk, J. J. & Loeb, L. A. Optimization of DNA polymerase mutation rates during bacterial evolution. Proc. Natl Acad. Sci. USA 107, 1154–1159 (2010).
Article ADS CAS PubMed Google Scholar
Raynes, Y., Wylie, C. S., Sniegowski, P. D. & Weinreich, D. M. Sign of selection on mutation rate modifiers depends on population size. Proc. Natl Acad. Sci. USA 115, 3422–3427 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Wiser, M. J., Ribeck, N. & Lenski, R. E. Long-term dynamics of adaptation in asexual populations. Science 342, 1364–1367 (2013).
Article ADS CAS PubMed Google Scholar
Furio, V., Moya, A. & Sanjuan, R. The cost of replication fidelity in an RNA virus. Proc. Natl Acad. Sci. USA 102, 10233–10237 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Fitzsimmons, W. J. et al. A speed-fidelity trade-off determines the mutation rate and virulence of an RNA virus. PLoS Biol. 16, e2006459 (2018).
Article PubMed PubMed Central Google Scholar
Wagner, G. P. & Zhang, J. The pleiotropic structure of the genotype-phenotype map: the evolvability of complex organisms. Nat. Rev. Genet. 12, 204–213 (2011).
Article CAS PubMed Google Scholar
Lindsay, S. J., Rahbari, R., Kaplanis, J., Keane, T. & Hurles, M. E. Similarities and differences in patterns of germline mutation between mice and humans. Nat. Commun. 10, 4053 (2019).
Article ADS PubMed PubMed Central Google Scholar
Laughery, M. F. et al. New vectors for simple and streamlined CRISPR-Cas9 genome editing in Saccharomyces cerevisiae. Yeast 32, 711–720 (2015).
Article CAS PubMed Google Scholar
Avila, V. et al. Increase of the spontaneous mutation rate in a long-term experiment with Drosophila melanogaster. Genetics 173, 267–277 (2006).
Article CAS PubMed PubMed Central Google Scholar
Saxena, A. S., Salomon, M. P., Matsuba, C., Yeh, S. D. & Baer, C. F. Evolution of the mutational process under relaxed selection in Caenorhabditis elegans. Mol. Biol. Evol. 36, 239–251 (2019).
Article CAS PubMed Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Article CAS PubMed PubMed Central Google Scholar
Lang, G. I. Measuring mutation rates using the Luria-Delbrück fluctuation assay. Methods Mol. Biol. 1672, 21–31 (2018). Springer.
Article CAS PubMed Google Scholar
Gillet-Markowska, A., Louvel, G. & Fischer, G. bz-rates: a web tool to estimate mutation rates from fluctuation analysis. G3 (Bethesda) 5, 2323–2327 (2015).
Article PubMed Google Scholar
Hamon, A. & Ycart, B. Statistics for the Luria-Delbrück distribution. Electron J. Stat. 6, 1251–1272 (2012).
Article MathSciNet MATH Google Scholar
Maclean, C. J. et al. Deciphering the genic basis of yeast fitness variation by simultaneous forward and reverse genetics. Mol. Biol. Evol. 34, 2486–2502 (2017).
Article CAS PubMed Google Scholar
Zhu, Y. O., Siegal, M. L., Hall, D. W. & Petrov, D. A. Precise estimates of mutation rate and spectrum in yeast. Proc. Natl Acad. Sci. USA 111, E2310–E2318 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Sharp, N. P., Sandell, L., James, C. G. & Otto, S. P. The genome-wide rate and spectrum of spontaneous mutations differ between haploid and diploid yeast. Proc. Natl Acad. Sci. USA 115, E5046–E5055 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Elyashiv, E. et al. Shifts in the intensity of purifying selection: an analysis of genome-wide polymorphism data from two closely related yeast species. Genome Res. 20, 1558–1573 (2010).
Article CAS PubMed PubMed Central Google Scholar
Bobay, L. M. & Ochman, H. Factors driving effective population size and pan-genome evolution in bacteria. BMC Evol. Biol. 18, 153 (2018).
Article CAS PubMed PubMed Central Google Scholar
Fawcett, J. A. et al. Population genomics of the fission yeast Schizosaccharomyces pombe. PLoS One 9, e104241 (2014).
Article ADS PubMed PubMed Central Google Scholar
Smith, D. R. & Lee, R. W. Nucleotide diversity in the mitochondrial and nuclear compartments of Chlamydomonas reinhardtii: investigating the origins of genome architecture. BMC Evol. Biol. 8, 156 (2008).
Article PubMed PubMed Central Google Scholar
Nordborg, M. et al. The pattern of polymorphism in Arabidopsis thaliana. PLoS Biol. 3, e196 (2005).
Article PubMed PubMed Central Google Scholar
Shapiro, J. A. et al. Adaptive genic evolution in the Drosophila genomes. Proc. Natl Acad. Sci. USA 104, 2271–2276 (2007).
Article ADS PubMed PubMed Central Google Scholar
Phifer-Rixey, M. et al. Adaptive evolution and effective population size in wild house mice. Mol. Biol. Evol. 29, 2949–2955 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 475, 493–496 (2011).
Article CAS PubMed PubMed Central Google Scholar
Nieuwenhuis, B. P. & James, T. Y. The frequency of sex in fungi. Philos. Trans. R. Soc. Lond. B Biol. Sci. 371, 20150540 (2016).
Article PubMed PubMed Central Google Scholar
Tang, C. et al. The evolution of selfing in Arabidopsis thaliana. Science 317, 1070–1072 (2007).
Article ADS CAS PubMed Google Scholar
Sasso, S., Stibor, H., Mittag, M. & Grossman, A. R. From molecular manipulation of domesticated Chlamydomonas reinhardtii to survival in nature. eLife 7, e39233 (2018).
Article PubMed PubMed Central Google Scholar
Meader, S., Ponting, C. P. & Lunter, G. Massive turnover of functional sequence in human and other mammalian genomes. Genome Res. 20, 1335–1343 (2010).
Article CAS PubMed PubMed Central Google Scholar
Rands, C. M., Meader, S., Ponting, C. P. & Lunter, G. 8.2% of the Human genome is constrained: variation in rates of turnover across functional element classes in the human lineage. PLoS Genet. 10, e1004525 (2014).
Article PubMed PubMed Central Google Scholar
Lee, H., Popodi, E., Tang, H. & Foster, P. L. Rate and molecular spectrum of spontaneous mutations in the bacterium Escherichia coli as determined by whole-genome sequencing. Proc. Natl Acad. Sci. USA 109, E2774–E2783 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Sung, W. et al. Asymmetric context-dependent mutation patterns revealed through mutation-accumulation experiments. Mol. Biol. Evol. 32, 1672–1683 (2015).
Article CAS PubMed PubMed Central Google Scholar
Farlow, A. et al. The spontaneous mutation rate in the fission yeast Schizosaccharomyces pombe. Genetics 201, 737–744 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ness, R. W., Morgan, A. D., Vasanthakrishnan, R. B., Colegrave, N. & Keightley, P. D. Extensive de novo mutation rate variation between individuals and across the genome of Chlamydomonas reinhardtii. Genome Res. 25, 1739–1749 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ossowski, S. et al. The rate and molecular spectrum of spontaneous mutations in Arabidopsis thaliana. Science 327, 92–94 (2010).
Article ADS CAS PubMed Google Scholar
Keightley, P. D. et al. Analysis of the genome sequences of three Drosophila melanogaster spontaneous mutation accumulation lines. Genome Res. 19, 1195–1201 (2009).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Liangke Gou and Leonid Kruglyak for providing the raw replicate mutation rate estimates of seven natural strains of yeast and Alex Kondrashov, Wenfeng Qian, and members of the Zhang laboratory for valuable comments. This work was supported by the U.S. National Institutes of Health research grant R35GM139484 to J.Z.

Author information

Authors and Affiliations

Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
Haoxuan Liu & Jianzhi Zhang

Authors

Haoxuan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jianzhi Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.L. and J.Z. designed the study and wrote the paper. H.L. performed the research and analyzed the data.

Corresponding author

Correspondence to Jianzhi Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Description of Additional Supplementary Files

Supplementary Data 1

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, H., Zhang, J. The rate and molecular spectrum of mutation are selectively maintained in yeast. Nat Commun 12, 4044 (2021). https://doi.org/10.1038/s41467-021-24364-6

Download citation

Received: 28 April 2021
Accepted: 10 June 2021
Published: 30 June 2021
DOI: https://doi.org/10.1038/s41467-021-24364-6

This article is cited by

Rapid evolution of mutation rate and spectrum in response to environmental and population-genetic challenges
- Wen Wei
- Wei-Chin Ho
- Michael Lynch
Nature Communications (2022)
Efficacy of mutagenic treatment with gamma-rays, EMS and combinations in producing superior mutants in okra (Abelmoschus esculentus L.)
- Nivedita Gupta
- Sonia Sood
Vegetos (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.