An experimental test on the probability of extinction of new genetic variants

Chelo, Ivo M.; Nédli, Judit; Gordo, Isabel; Teotónio, Henrique

doi:10.1038/ncomms3417

Download PDF

Article
Open access
Published: 13 September 2013

An experimental test on the probability of extinction of new genetic variants

Ivo M. Chelo¹,
Judit Nédli¹,
Isabel Gordo¹ &
…
Henrique Teotónio¹

Nature Communications volume 4, Article number: 2417 (2013) Cite this article

6427 Accesses
18 Citations
55 Altmetric
Metrics details

Subjects

Population genetics

Abstract

In 1927, J.B.S. Haldane reasoned that the probability of fixation of new beneficial alleles is twice their fitness effect. This result, later generalized by M. Kimura, has since become the cornerstone of modern population genetics. There is no experimental test of Haldane’s insight that new beneficial alleles are lost with high probability. Here we demonstrate that extinction rates decrease with increasing initial numbers of beneficial alleles, as expected, by performing invasion experiments with inbred lines of the nematode Caenorhabditis elegans. We further show that the extinction rates of deleterious alleles are higher than those of beneficial alleles, also as expected. Interestingly, we also find that for these inbred lines, when at intermediate frequencies, the fate of invaders might not result in their ultimate fixation or loss but on their maintenance. Our study confirms the key results from classical population genetics and highlights that the nature of adaptation can be complex.

Experimental evolution of adaptive divergence under varying degrees of gene flow

Article 11 January 2021

Rapid evolution of mutation rate and spectrum in response to environmental and population-genetic challenges

Article Open access 13 August 2022

The population genomics of adaptive loss of function

Article Open access 11 February 2021

Introduction

When a new allele appears by mutation or recombination, it will be represented in very low numbers and it can be lost even when it is adaptive. This idea was first formalized by J.B.S. Haldane when he deduced that the probability of fixation of a new beneficial allele is twice its fitness effect¹. Haldane’s insight became the foundation of modern population genetics^2,3,4,5 and has since been used to understand the limits of adaptation^6,7,8,9, speciation rates^10,11,12 and the extent of differentiation between populations^13,14, including those of pathogens and their human hosts^15,16,17.

Haldane’s result only applies to beneficial alleles and it depends on the invader and resident alleles having independent growth dynamics that will compete in populations characterized by discrete non-overlapping generations and stable sizes^{1,18,19,20,21,22}. His approximation is further conditional on alleles having weak fitness effects and on individuals having successful offspring numbers that follow Poisson distributions. Together, these assumptions allow for genetic drift to be described as a branching process and for the probability of fixation to be calculated from selection coefficients, which in turn can be measured in finite populations as the relative frequency changes of invader and resident alleles²³.

The critical prediction made by Haldane is that, even with a substantial selection advantage, the probability of extinction declines with the number of individuals carrying the invading beneficial allele. There are no explicit experimental tests of this prediction. Results concerning the extinction of beneficial alleles are few^24,25, with the most relevant ones coming from recent experiments where it was shown that the probability of extinction declines with increasing strength of selection²⁴. In this latter study, however, extinctions due to genetic drift had to be modelled because the manipulation of the number of invader alleles was not possible.

Here we use highly replicated experiments in the hermaphroditic nematode Caenorhabditis elegans to introduce beneficial alleles at precise low numbers in a resident population composed of deleterious alleles to test Haldane’s prediction.

Results

The C. elegans model system

We started by deriving a collection of inbred lines through self-fertilization of hermaphrodites of a polymorphic population that had been adapted for 140 generations to laboratory conditions. These laboratory conditions involved maintaining stable census population sizes of 10⁴ individuals, and effective population sizes of 10³, under discrete non-overlapping generations^26,27. In preliminary trials, the derived inbred lines were characterized for embryo to adult survival, with the one showing the lowest values (here referred to as the wild-type line) being chosen as the potentially deleterious surrogate for the resident allele.

In parallel, we introgressed a green fluorescent protein (GFP) genetic construct into the lab-adapted population (see Methods), while maintaining genetic diversity, and then also enforced self-fertilization to derive another collection of inbred lines homozygous for GFP. One of these GFP lines (here referred to as the GFP line) was arbitrarily chosen as the potentially beneficial surrogate for the invader allele. Note that although henceforth we refer to the wild-type and the GFP inbred lines as two different ‘alleles’, these lines differ among them at many loci throughout the genome with each being composed of isogenic diploid individuals. Defining the invader and resident alleles in this manner prevents artifactual positive results due to the GFP introgression. Though GFP expression is fully penetrant and highly visible across larval and adult life-history stages, and is further known to be stably inserted into the genome²⁸, difficulties in scoring GFP individuals could lead to inadvertent estimation of low extinction rates (see Methods).

Haldane’s assumption that competing alleles have independent growth dynamics was easily achieved with the number of generations of selfing employed for the inbred line derivation, which makes them nearly isogenic because the average expected heterozygosity across the genome is <0.01% (refs 3, 27) (see Methods). In line with this, we genotyped the inbred lines at 29 single-nucleotide polymorphisms (SNPs) and found no heterozygosity in them (Supplementary Data 1). Outcrossing among the inbred lines was expected to be of minor consequence because C. elegans hermaphrodites can only cross-fertilize their oocytes when mated with males²⁹. Males were initially absent due to the derivation of the inbred lines by selfing, and neither mutation nor selection for outcrossing was expected to increase male frequency for the short duration of the experiments^26,30. Together, these observations imply that segregation and recombination in the two inbred lines would be negligible during the experiments. The inbred lines were thus expected to accurately model haploid, or allelic, growth dynamics that are independent of each other.

Beneficial alleles are lost by genetic drift

To test for a decrease in extinction rates with the number of beneficial invaders, we performed experiments with two starting frequencies of the GFP allele: either two or five GFP reproductively immature individuals (late L4 larval stage) were introduced into populations with 10³ wild-type individuals of the same stage. Multiple replicate invasions were seeded and independently propagated for five generations in the same discrete non-overlapping generations experienced by the lab-adapted population during the preceding 140 generations. As total offspring numbers before sampling can reach several tens of thousands (Supplementary Fig. S1), there will be great culling to maintain constant population sizes of N=10³, and thus the distributions of successful offspring numbers most likely follow Poisson distributions¹⁸.

The probability of extinction (P_ext) was measured as the proportion of replicate invasions that failed to show the GFP allele when scored in adult worms, before reproduction, for two consecutive generations (see Methods). Though extinction of invaders is expected to predominantly occur in the first few generations, genetic drift will nonetheless lead to further loss until tens to hundreds of generations, depending on the initial number of invaders and strength of selection (Supplementary Fig. S2). Note, however, that running the experiments for longer periods at the large population sizes employed would allow mutations to start accumulating^31,32.

Results from the two invasion experiments show, as expected, that the P_ext was higher when invader alleles started from lower frequencies (Fig. 1a,b). In particular, by generation 5, the GFP allele was lost from 37% of the replicates when it started the invasions from two individuals (n=35, bootstrap 95% confidence interval (CI)=23–51%), but only lost from 12.5% of the replicates when starting from five individuals (n=56, bootstrap 95% CI=5–21%). To confirm that these results were not biased by an insufficient number of replicates, we next sought to determine if selection on the GFP allele was congruent among the two invasion experiments.

**Figure 1: Extinction decreases with number of invading alleles.**

For this, we numerically simulated the expected change in GFP allele frequency during the invasion experiments as a function of its relative fitness, while accounting for the total number of individuals and the life cycle of the populations. Specifically, the relative growth rates of GFP and wild-type alleles were modelled as 1+s and 1, respectively, with s being defined as a selection coefficient³. We estimate selection not by an analytical approach, which determines the ultimate probability of fixation for weak or strong selection, but by exact Monte–Carlo simulations (see Methods).

Maximum likelihood (ML) analyses of the allele counts done during the last three generations of the experiments confirmed our initial suspicion that the GFP inbred line was fitter than the wild-type inbred line, regardless of initial numbers (Fig. 1c,d). The estimated selection coefficient for the GFP allele is consistent between the two experiments; as the estimates have overlapping confidence limits ranging from s=0.14 to s=0.20. Among experiments, the mean selection coefficient of the GFP allele was of 0.16. Together, the probability of extinction measured at generation 5 and the selection coefficients estimated demonstrate that genetic drift was responsible for the loss of beneficial alleles during the first generations after their invasion.

Deleterious alleles temporally evade genetic drift

In contrast to the fate of beneficial invader alleles, deleterious alleles can reach high frequencies by genetic drift when N_es<1 (refs 5, 20, 21, 22). Although in our case Ns>10, theory nevertheless predicts that the extinction rates of a rare deleterious allele should be higher than that of an invading beneficial allele of equivalent absolute fitness. We next tested this prediction by asking whether or not inverting the direction of the invasions would lead to higher P_ext, and if so the selection coefficient would be of similar magnitude but negative in sign.

For this second set of invasion experiments, five individuals from the wild-type inbred line were introduced into a population with 10³ GFP individuals. Results show that after five generations the wild-type allele had been lost in 56% of the replicates (Fig. 2a). This was a proportion, which, as expected, was much higher than that observed when the five GFP individuals invaded the wild-type population (from Fig. 1b, 12.5%). Furthermore, the median selection coefficient that best explained the P_ext of the (deleterious) wild-type allele was of −0.138 (Fig. 2a), which in magnitude, and also as expected, closely matched the selective coefficient of the GFP allele found in the first set of invasion experiments (from Fig. 1c,d; mean s=0.16).

**Figure 2: Probability of extinction and fixation.**

Having shown that genetic drift has an impact on the establishment of beneficial and deleterious alleles, we can ask what are their expected probabilities of fixation (P_fix). For this, we turned to M. Kimura’s diffusion approach that generalizes Haldane’s result to both beneficial and deleterious alleles^20,21: , where p is the initial frequency of the invader allele, and N_e is the effective size of the population. With N_e=10³, the expected P_fix of a beneficial allele would be 0.46 or 0.79 when invading a population with two or five individuals, respectively (Fig. 2b). If the invader allele is deleterious, fixation is impossible with just five individuals and N_e=10³. If during the invasion experiments sizes were of N_e=10² or lower, which reflects typical situations when the effective population sizes are smaller than the census population sizes^9,27, the P_fix of a deleterious allele would increase and likely become significant.

For the calculations of P_fix, the assumption of weak selection is violated because high selection coefficients were found. For beneficial alleles, when N_es>>1, Kimura’s expression shown above simplifies to , with n being the number of invaders, which predicts no dependence on population sizes (as also shown in Fig. 2b). Yet, even with strong positive selection, the P_fix is expected to be different when two or five individuals invade a resident population. Furthermore, it has been previously shown that, with not very high selection coefficients, relatively small discrepancies in P_fix between Kimura’s diffusion approach and simulations of genetic drift under several explicit demographic scenarios are expected^33,34. Kimura’s approach, even if originally devised for weak selection, should thus remain a good approximation of the ultimate P_fix of an invader allele, whether it is beneficial or deleterious, in the range of selection coefficients explored here.

Frequency-dependent selection might maintain polymorphism

In Haldane’s result, the probability of fixation of rare beneficial alleles is conditional only on their survival during the initial stages of the invasion, where extinction by genetic drift is common. It is possible, however, that with frequency dependence, successful invasions of beneficial alleles do not ultimately result in their fixation. Our final experiments were done to illustrate this possibility by competing of the same wild-type and GFP inbred lines, but now at starting intermediate frequencies. As the invasion experiments above, these intermediate-frequency experiments were run for multiple generations, and scoring of the GFP was done at the adult life stage. Setup was, however, done at the first larval stage (L1 stage) as it would have been extremely difficult to separately handpick thousands of individuals and have adequate replication. Again as in the invasion experiments, the demographic conditions of discrete non-overlapping generations at N=10³ were followed. This time, we used the deterministic expectations of GFP allele frequency change over generations to estimate selection^3,19 (see Methods).

Strikingly, when GFP and wild-type inbred lines were competed at approximately equal proportions, a significant decrease in GFP alleles was observed (Fig. 3a). At intermediate frequencies, the GFP allele was therefore found to be deleterious (mean s=−0.12), contrary to the beneficial fitness effect that we found when it invaded populations from low frequencies. During the first generation of this intermediate-frequency experiment, we also scored the GFP allele frequency at the L1 larval stage. When comparing the selection coefficients estimated from a single generation of competition at the L1 and the adult stage, no differences were detected (Fig. 3b). Independently of the protocol employed therefore, whether it is single or multi-generational or whether scoring is done at the L1 or at the adult stage, the GFP allele was consistently deleterious when at intermediate frequencies.

**Figure 3: Frequency-dependent selection.**

To further support the change in the sign of selection, we repeated the competitions but now starting from three different intermediate GFP proportions (see Methods): 0.15, 0.5 and 0.85. These experiments were run for a single generation, and the GFP allele frequency was measured at the L1 larval stage, again to have adequate replication. Results show that the GFP allele continued to be deleterious for these different intermediate frequencies (Fig. 3c, mean s between −0.18 and −0.40).

When the selection coefficients estimated from the latter experiments and from the invasion experiments are plotted together with their corresponding starting GFP frequencies, a non-linear function clearly emerges (Fig. 3c). Positive selection coefficients at the extreme frequencies employed in the invasion experiments demonstrate that individuals carrying the GFP allele can escape loss when rare or, alternatively, resist the invasion of the wild-type allele. When between 5 and 93%, however, selection will be against the GFP allele. In other words, there was frequency-dependent selection among the two inbred lines.

It is possible that underlying frequency-dependent selection are differences in the reproductive behaviour of the two inbred lines, as they seem to differ in embryo laying and embryo retention rates, as well as in successfully propagating themselves in our specific culture protocol (Supplementary Figs S1 and S3–S5). Interactions among fitness components are nevertheless complex, and only a comprehensive characterization of several life-history traits would allow a full understanding of the possible mechanisms underlying the observed frequency dependence, an endeavour that is beyond the scope of the present study.

We finish by showing that the successful invasion of beneficial alleles in a population might not necessarily lead to their ultimate fixation. From the fitness function of Fig. 3c, it is apparent that there are two equilibrium frequencies but only one of them is stable, at around 5%. Taking as an example the invasion experiments that were seeded with five GFP individuals, frequency-dependent selection or frequency-independent selection would initially have similar consequences to the extinction of beneficial invaders because they would be relatively rare (Fig. 4a). After 100 generations or so, however, and with frequency-independent selection, most of the populations where extinction did not occur would be fixed for the beneficial invader allele. In contrast, with frequency-dependent selection, polymorphism would be maintained at the stable equilibrium frequency. One of the expected outcomes of evolution under frequency-dependent selection is thus the maintenance of polymorphism for longer periods than under frequency-independent selection (Fig. 4b,c). If, however, the stable equilibrium frequency is of sufficiently low value, genetic drift can lead to higher extinction rates than those expected with frequency-independent selection (Fig. 4c, dashed lines). Higher extinction of new beneficial alleles is therefore expected to be important for populations with small sizes, as in them genetic drift is more pronounced.

**Figure 4: Selection may maintain diversity and increase extinction.**

Discussion

We have shown that even beneficial alleles with strong fitness effects can be lost by genetic drift, if they have not reached a high enough frequency for their dynamics to become deterministic^1,18,19. Deleterious alleles on the other hand can escape genetic drift and be maintained at relatively high frequencies, particularly in small populations, at least until selection efficiently purges them^20,21,22. Together, the invasion experiments clearly confirm classical theory in population genetics^2,3,4,5.

As also shown, however, once beneficial alleles are established in a population, they might not speed to fixation, but be maintained, because at intermediate frequencies their adaptive values can change in sign. Though the generality of frequency-dependent selection remains to be shown, our observations from the intermediate-frequency experiments in these strains might be relevant for interpreting variation in natural populations that are genetically structured by variation in ploidy levels^35,36, assortative mating and inbreeding among relatives^14,37, population subdivision^38,39 or repeated hybridization^40,41. In all of them, frequency-dependent selection might be common, which together with genetic drift can hinder adaptation to novel environments.

In conclusion, classical theory in population genetics is confirmed, but natural selection might not be of invariable magnitude and sign. Our findings thus set the stage for the development of more general theoretical models explaining the fate of new alleles across long evolutionary timescales^22,42,43,44.

Methods

Inbred lines

The wild-type inbred line (EEV1401) was derived by 12 consecutive generations of self-fertilization of hermaphrodites coming from a polymorphic lab-adapted population (A6₁₄₀). Twenty wild-type inbred lines were derived by the expansion of the last generation of self-fertilization to high sample sizes (>10³) during two generations. M9 solutions containing arrested L1-staged individuals were mixed in equal proportions to a solution of 1 M NaCl, 1 M KH₂PO₄, glycerine and 1 M MgSO₄, with stocks being then frozen at −80^oC (ref. 45). The lab-adapted population had been maintained for 140 generations under discrete non-overlapping generations at constant population census sizes of N=10⁴ (refs 26, 27). As determined at SNPs covering 1/3 of the genome, the effective population sizes were of N_e=10³ until generation 100 of experimental evolution^26,27.

To derive an inbred line that would allow easy scoring during the experiments reported here, the genome-integrated transgenic array ccls4251(myo-3::GFP)²⁸, from strain PD4251, was first introgressed into A6₁₄₀. The transgenic array is stable and expressed in muscle cells of all larval and adult stages²⁸. Outcrossing of PD4251 males with A6₁₄₀ hermaphrodites was followed by three consecutive generations of self-fertilization to obtain separate GFP F2 families. As the GFP is dominant, only F2 individuals coming from families where all relatives expressed it were kept for the next introgression cycle. Five of these introgression cycles were done, each starting with the mating of GFP homozygous F2 hermaphrodites to an excess of males from A6₁₄₀. Five to ten families were employed at each introgression cycle to ensure maintenance of diversity. Once this A6₁₄₀ GFP population was constructed, 10 consecutive generations of self-fertilization in hermaphrodites were done to derive three inbred GFP lines. One of these was arbitrarily chosen for the experiments (EEV1402).

Inbred line genetic diversity

The extent of heterozygosity within each inbred line was determined by genotyping 29 SNPs in pools of 20–30 individuals (Supplementary Data 1). These SNPs are located in chromosome IV and are not under high linkage disequilibrium in the A6₁₄₀ population (mean r²=0.03 (ref. 27)). Genomic DNA form was prepared with the ZyGEM prepGEM Insect kit following the manufacturer’s protocol. Genotypes were mass determined with allele-specific extension reactions on oligonucleotides generated from PCR-amplified genomic DNA using the iPlex Sequenom MALDI-TOF platform^27,46. Expected heterozygosities upon inbreeding were obtained by: H_t=1−[(1/2)^t H₀] (ref. 3), with H₀=0.24 being the average heterozygosity of A6₁₄₀ measured at 334 SNPs, H the Hardy–Weinberg proportions and t the number of generations of self-fertilization.

Culturing of populations

Inbred lines were revived for each set of experiments from −80 ^oC stocks and expanded for two generations under a common environment before setup. All replicates were separately passaged under discrete 4-day non-overlapping life cycles at constant sizes of 10³ individuals. Developmental synchronization occurred at the first larval stage (L1), after which seeding was done to 9 cm petri plates with NGM-lite agar (US Biological), carrying a fully confluent lawn of Escherichia coli (HT115 strain). After 72±2 h of growth under constant temperature (20° C) and relative humidity (80%), worms were removed from the plates, subjected to a 1 M KOH: 5% NaOCl solution for 5min and repeatedly washed in M9 buffer. Only the embryos that were laid in the plates and transferred, together with those that burst out of the hermaphrodite body, survived this procedure (Supplementary Fig. S3). After 24±2 h, L1s were collected to seed the following generation. Densities were estimated by scoring the number of L1s in five 5 μl samples (Supplementary Fig. S6).

Invasion experiments

For the experiments where the GFP line was the invader, 35 petri dishes were set up, each with two immature late L4 individuals (L4s; 48±2 h after L1 seeding), whereas 56 petri dishes were set up with five individuals, in a resident population of 10³ wild-type individuals of similar developmental stage. All these were concurrently passaged for five generations. Setup was thus done before reproduction (G0). At G1 and G2, plates were only scored for presence/absence of GFP individuals, and from G3–G5 all individuals were scored. Loss of GFP was only considered if it could be confirmed in two consecutive generations.

For the experiments where the wild-type line was the invader, 50 petri dishes were set up with five individuals. These were done at a different time and thus involved another revival of the lines from the −80 ^oC stocks. In these experiments, only a sample of 300 individuals were scored from G1 to G5. GFP loss was declared if the wild-type individuals failed to be detected for three consecutive generations. The probability of extinction measured at G5 had thus to include observations from extra G6 and G7.

Invaders were scored at the adult stage before passage with a stereoscope equipped with a mercury lamp and GFP filters at × 15 magnification. For the exact counts of GFP individuals, see Supplementary Tables S1 and S2.

Selection in invasion experiments

In the experiments with GFP individuals as invaders, selection coefficients (s) were obtained by ML using Monte–Carlo numerical simulations. Each simulation started with the initial invader frequencies at fixed 2 × 10⁻³ or 5 × 10⁻³ that were sampled following:

where t is generation^3,19. Equation (1) thus formulates the expected frequency change of a GFP allele with growth rate 1+s relative to a wild-type allele of growth rate 1. A random number of GFP alleles were sampled for four generations, following a binomial distribution with p_t+1 as the probability of success and 10³ as the sample size. 10⁸ replicate simulations were done for each invasion experiment and for each of 101 selection coefficients defined in a grid of points between 0 and 1. At G3, G4 and G5, the empirical likelihood estimates were obtained by comparison of simulated and observed GFP individual counts. In the range where likelihood values were higher than the ML minus 10 log-likelihoods, 9 × 10⁸ extra replicate simulations were performed. A quadratic function was then fitted to these intervals to obtain the ML estimate minus 2 log-likelihoods for CIs. In this last step, experimental data that were not observed in at least 50 simulations, for any given s, were removed to reduce noise.

In the experiments with wild-type individuals as invaders, the interval of s values compatible with the number of observed line losses at G5 was obtained in a similar manner. Replicate simulations of 10³ were done for selection coefficients defined in a grid of points between −0.4 and 0.1. One extra step in the algorithm relative to that above was introduced after experimental passage to reflect the sampling of 300 individuals for scoring. As for the experimental data, extinction was declared when the invader was not observed for three consecutive generations until G7. The proportion of replicates where extinctions occurred by G5 were recorded as the median value and the interquantile range, because the simulated data were not normal. Linear interpolation was used to find the best estimate of s and its error range. All computations were done using R⁴⁷.

Intermediate-frequency experiments

Inbred lines were revived from −80 ^oC stocks and expanded for two generations under common environmental conditions. On the third generation, L1-staged GFP individuals were mixed with L1-staged wild-type individuals at a proportion of 0.5. The GFP frequencies observed in the adults of that generation constituted the starting frequencies for the estimation of selection coefficients in the adult stage. Eleven replicate competitions were then passaged for four generations in the same conditions employed for the invasion experiments. The mean number of adults scored per replicate and generation was of 767 and not less than 414. Estimates of GFP frequency changes were also obtained from L1-stage individuals for the set-up frequencies and after one generation (see below) to compare estimates between L1 and adult stages.

For the characterization of frequency dependence, GFP and wild-type L1s were mixed at three different starting proportions: 0.15, 0.50 and 0.85. These intermediate-frequency experiments were conducted at a different time and thus involved yet another revival of the inbred lines from the −80 ^oC stocks. For each starting frequency, 10 replicate competitions were done by seeding 10³ individuals. GFP scoring was done in the following generation at the L1 larval stage, by placing 5 μl of the M9 buffer containing L1s on glass slide and photographing them at 2 pixel μm⁻¹ resolution under a microscope equipped with a GFP filter. The mean number of L1s scored per replicate competition was of 1,159 and not less than 762. Allele frequency estimates were similarly obtained for the starting mixes to estimate sampling error (Supplementary Fig. S6).

Selection in the intermediate-frequency experiments

For the multiple generation experiment where adults were scored, selection coefficients were obtained by solving equation (1) shown above, given the observed GFP allele frequencies. A mean selection coefficient (ŝ) was obtained from the intercept of a mixed model, where replicate competitions where used as a random factor. The calculation of different selection estimates separately at the several generations indicated congruency among them, and hence time was not included in the model. The resulting selection coefficient is thus the mean s obtained from the different replicate competitions. Note that the most common estimation method for s, using the log ratio of allele frequencies that changes with time^2,48, would give estimates close to those reported in the main text (s=−0.16±0.03 s.e.m. with the log ratio method, compared with s=−0.12±0.03 s.e.m. by solving equation (1)). As equation (1) was used in the Monte–Carlo simulations for estimation of selection in the invasion experiments, we continued to use it for the estimation of selection in the intermediate-frequency experiments.

Simulations of frequency-dependent selection

Monte–Carlo simulations followed the same algorithm as in the invasion simulations with starting GFP proportions of 0.005. At each generation, s was evaluated (from equation (1)) according to the fitness function of Fig. 3c, to model frequency dependence, or by fixing s when at P=0.005 in the same fitness function, to model frequency independence. One thousand replicates in each selection scenario were obtained. P_ext and P_fix were calculated from the number of replicates where the invader allele was lost or fixed, respectively.

Additional information

How to cite this article: Chelo, I. M. et al. An experimental test on the probability of extinction of new genetic variants. Nat. Commun. 4:2417 doi: 10.1038/ncomms3417 (2013).

References

Haldane, J. A mathematical theory of natural and artificial selection, part V: selection and mutation. Math. Proc. Cambridge Phil. Soc. 23, 838–844 (1927).
Article ADS Google Scholar
Fisher, R. The Genetical Theory of Natural Selection Oxford University Press (1930).
Crow, J. F. & Kimura, M. An Introduction to Population Genetics Theory Harper & Row, Publishers (1970).
Wright, S. Evolution and the Genetics of Populations: Variability within and among Natural Populations Vol. 4, (University of Chicago Press (1978).
Kimura, M. The Neutral Theory of Molecular Evolution Cambridge University Press (1983).
Lenski, R. E., Rose, M. R., Simpson, S. C. & Tadler, S. C. Long-term experimental evolution in Escherichia coli 1. Adaptation and divergence during 2,000 generations. Am. Nat. 138, 1315–1341 (1991).
Article Google Scholar
Perfeito, L., Fernandes, L., Mota, C. & Gordo, I. Adaptive mutations in bacteria: high rate and small effects. Science 317, 813–815 (2007).
Article CAS ADS Google Scholar
Rice, W. R. & Chippindale, A. K. Sexual recombination and the power of natural selection. Science 294, 555–559 (2001).
Article CAS ADS Google Scholar
Teotonio, H., Chelo, I. M., Bradic, M., Rose, M. R. & Long, A. D. Experimental evolution reveals natural selection on standing genetic variation. Nat. Genet. 41, 251–257 (2009).
Article CAS Google Scholar
Chimpanzee, S. & Analysis, C. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437, 69–87 (2005).
Article Google Scholar
Navarro, A. & Barton, N. H. Chromosomal speciation and molecular divergence-accelerated evolution in rearranged chromosomes. Science 300, 321–324 (2003).
Article CAS ADS Google Scholar
Seehausen, O. et al. Speciation through sensory drive in cichlid fish. Nature 455, 620–626 (2008).
Article CAS ADS Google Scholar
Linnen, C. R., Kingsley, E. P., Jensen, J. D. & Hoekstra, H. E. On the origin and spread of an adaptive allele in deer mice. Science 325, 1095–1098 (2009).
Article CAS ADS Google Scholar
Andersen, E. C. et al. Chromosome-scale selective sweeps shape Caenorhabditis elegans genomic diversity. Nat. Genet. 44, 285–290 (2012).
Article CAS Google Scholar
Bustamante, C. D. et al. Natural selection on protein-coding genes in the human genome. Nature 437, 1153–1157 (2005).
Article CAS ADS Google Scholar
Voight, B. F., Kudaravalli, S., Wen, X. & Pritchard, J. K. A map of recent positive selection in the human genome. PLoS Biol. 4, e72 (2006).
Article Google Scholar
Volkman, S. K. et al. A genome-wide map of diversity in Plasmodium falciparum. Nat. Genet. 39, 113–119 (2007).
Article CAS Google Scholar
Fisher, R. On the dominance ratio. Proc. Royal Soc. Edinburgh 42, 321–341 (1922).
Article Google Scholar
Wright, S. Evolution in Mendelian populations. Genetics 16, 97–159 (1931).
CAS PubMed PubMed Central Google Scholar
Kimura, M. Some problems of stochastic processes in genetics. Ann. Math. Stat. 28, 882–901 (1957).
Article MathSciNet Google Scholar
Kimura, M. On the probability of fixation of mutant genes in a population. Genetics 47, 713–719 (1962).
CAS PubMed PubMed Central Google Scholar
Waxman, D. A unified treatment of the probability of fixation when population size and the strength of selection change over time. Genetics 188, 907–913 (2011).
Article CAS Google Scholar
Patwa, Z. & Wahl, L. M. The fixation probability of beneficial mutations. J. R. Soc. Interface 5, 1279–1289 (2008).
Article CAS Google Scholar
Gifford, D. R., de Visser, J. A. & Wahl, L. M. Model and test in a fungus of the probability that beneficial mutations survive drift. Biol. Lett. 9, 20120310 (2012).
Article Google Scholar
Berenos, C., Wegner, K. M. & Schmid-Hempel, P. Antagonistic coevolution with parasites maintains host genetic diversity: an experimental test. Proc. Biol. Sci. 278, 218–224 (2011).
Article Google Scholar
Teotonio, H., Carvalho, S., Manoel, D., Roque, M. & Chelo, I. M. Evolution of outcrossing in experimental populations of Caenorhabditis elegans. PLoS One 7, (2012).
Chelo, I. M. & Teotonio, H. The opportunity for balancing selection in experimental populations of Caenorhabditis elegans. Evolution 67, 142–156 (2012).
Article Google Scholar
Fire, A. et al. Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. Nature 391, 806–811 (1998).
Article CAS ADS Google Scholar
Maupas, E. Modes et formes de reproduction des nematodes. Arch. Exp. Gen. Ser. 3, 463–624 (1900).
Google Scholar
Teotónio, H., Manoel, D. & Phillips, P. C. Genetic variation for outcrossing among Caenorhabditis elegans isolates. Evolution 60, 1300–1305 (2006).
Article Google Scholar
Denver, D. R. et al. Selective sweeps and parallel mutation in the adaptive recovery from deleterious mutation in Caenorhabditis elegans. Genome Res. 20, 1663–1671 (2010).
Article CAS Google Scholar
Estes, S., Phillips, P. C., Denver, D. R., Thomas, W. K. & Lynch, M. Mutation accumulation in populations of varying size: the distribution of mutational effects for fitness correlates in Caenorhabditis elegans. Genetics 166, 1269–1279 (2004).
Article CAS Google Scholar
Barrett, R. D., M'Gonigle, L. K. & Otto, S. P. The distribution of beneficial mutant effects under strong selection. Genetics 174, 2071–2079 (2006).
Article CAS Google Scholar
Heffernan, J. M. & Wahl, L. M. The effects of genetic drift in experimental evolution. Theor. Popul. Biol. 62, 349–356 (2002).
Article Google Scholar
Gerstein, A. C. & Otto, S. P. Cryptic fitness advantage: diploids invade haploid populations despite lacking any apparent advantage as measured by standard fitness assays. PLoS One 6, e26599 (2011).
Article CAS ADS Google Scholar
Sellis, D., Callahan, B. J., Petrov, D. A. & Messer, P. W. Heterozygote advantage as a natural consequence of adaptation in diploids. Proc. Natl Acad. Sci. USA 108, 20666–20671 (2011).
Article CAS ADS Google Scholar
Horton, M. W. et al. Genome-wide patterns of genetic variation in worldwide Arabidopsis thaliana accessions from the RegMap panel. Nat. Genet. 44, 212–216 (2012).
Article CAS Google Scholar
Hanski, I. & Saccheri, I. Molecular-level variation affects population growth in a butterfly metapopulation. PLoS Biol. 4, e129 (2006).
Article Google Scholar
Cutter, A. D., Wang, G. X., Ai, H. & Peng, Y. Influence of finite-sites mutation, population subdivision and sampling schemes on patterns of nucleotide polymorphism for species with molecular hyperdiversity. Mol. Ecol. 21, 1345–1359 (2012).
Article CAS Google Scholar
Labbe, P., Sidos, N., Raymond, M. & Lenormand, T. Resistance gene replacement in the mosquito Culex pipiens: fitness estimation from long-term cline series. Genetics 182, 303–312 (2009).
Article CAS Google Scholar
Caicedo, A. L., Stinchcombe, J. R., Olsen, K. M., Schmitt, J. & Purugganan, M. D. Epistatic interaction between Arabidopsis FRI and FLC flowering time genes generates a latitudinal cline in a life history trait. Proc. Natl Acad. Sci. USA 101, 15670–15675 (2004).
Article CAS ADS Google Scholar
Wootton, J. T. Field parameterization and experimental test of the neutral theory of biodiversity. Nature 433, 309–312 (2005).
Article CAS ADS Google Scholar
Uecker, H. & Hermisson, J. On the fixation process of a beneficial mutation in a variable environment. Genetics 188, 915–930 (2011).
Article Google Scholar
Huang, W. & Traulsen, A. Fixation probabilities of random mutants under frequency dependent selection. J. Theor. Biol. 263, 262–268 (2010).
Article MathSciNet Google Scholar
Stiernagle, T. Maintenance of C. elegans Oxford University Press (1999).
Bradic, M., Costa, J. & Chelo, I. M. inMolecular Methods for Evolutionary Genetics Vol. 772, (eds Orgogozo, V. & Rockman, M.) (Humana Press (2011).
R Development Core Team. R: A language and environment for statistical computing http://www.R-project.org (2006).
Chevin, L. M. On measuring selection in experimental evolution. Biol. Lett. 7, 210–213 (2011).
Article Google Scholar

Download references

Acknowledgements

We thank B. Afonso, S. Carvalho, C. Goy, P. Sandner and A. Silva for technical support, and T. Dean, D. Gresham, L. Perfeito, S. Proulx and L. Wahl for discussion. We also thank the reviewers for suggestions that greatly improved the presentation of this work. Funding was assured by grants from the Human Frontiers Science Program (RGP0045/2010) and the European Research Council (stERC/2009-243285) to H.T.

Author information

Authors and Affiliations

Instituto Gulbenkian de Ciência, Apartado 14, Oeiras, P-2781-901, Portugal
Ivo M. Chelo, Judit Nédli, Isabel Gordo & Henrique Teotónio

Authors

Ivo M. Chelo
View author publications
You can also search for this author in PubMed Google Scholar
Judit Nédli
View author publications
You can also search for this author in PubMed Google Scholar
Isabel Gordo
View author publications
You can also search for this author in PubMed Google Scholar
Henrique Teotónio
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

I.M.C., I.G. and H.T. designed the project. I.M.C. and J.N. conducted the experiments. I.M.C. analysed the data. I.M.C. and H.T. wrote the manuscript.

Corresponding author

Correspondence to Henrique Teotónio.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures S1-S6 and Supplementary Tables S1-S2 (PDF 575 kb)

Supplementary Data 1 (XLSX 13 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/

Reprints and permissions

About this article

Cite this article

Chelo, I., Nédli, J., Gordo, I. et al. An experimental test on the probability of extinction of new genetic variants. Nat Commun 4, 2417 (2013). https://doi.org/10.1038/ncomms3417

Download citation

Received: 01 November 2012
Accepted: 08 August 2013
Published: 13 September 2013
DOI: https://doi.org/10.1038/ncomms3417

This article is cited by

Reproductive assurance drives transitions to self-fertilization in experimental Caenorhabditis elegans
- Ioannis Theologidis
- Ivo M Chelo
- Henrique Teotónio
BMC Biology (2014)
Experimental determination of invasive fitness in Caenorhabditis elegans
- Ivo M Chelo
Nature Protocols (2014)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.