The genic capture hypothesis, where sexually selected traits capture genetic variation in condition and the condition reflects genome-wide mutation load, stands to explain the presence of abundant genetic variation underlying sexually selected traits. Here we test this hypothesis by applying bidirectional selection to male mating success for 14 generations in replicate populations of Drosophila melanogaster. We then resequenced the genomes of flies from each population. Consistent with the central predictions of the genic capture hypothesis, we show that genetic variance decreased with success selection and increased with failure selection, providing evidence for purifying sexual selection. This pattern was distributed across the genome and no consistent molecular pathways were associated with divergence, consistent with condition being the target of selection. Together, our results provide molecular evidence suggesting that strong sexual selection erodes genetic variation, and that genome-wide mutation-selection balance contributes to its maintenance.
The maintenance of female preferences for the ornaments of males is a long-standing evolutionary puzzle; why are females persistently choosy when their choice should erode genetic variation among males and preclude any benefits of choice (the lek paradox1,2,3)? However, despite being under strong, directional selection4, the amount of genetic variation underlying sexually selected traits is unexpectedly high5. The genic capture hypothesis proposes that high levels of genetic variation can be explained by sexual traits capturing genetic variation in condition6. Phenotypic condition is dependent on resource acquisition and allocation7,8, which, in turn, is thought to be dependent on traits at many loci throughout the genome9. Hence, condition could provide a large target for mutations10 and, as most new mutations are mildly deleterious11, genetic variation could be maintained in mutation-selection balance across the genome. The persistence of female choice also stands explained, as the offspring of choosy females should inherit a smaller burden of deleterious mutations. Empirical evidence demonstrates that sexually selected traits are condition dependent, that condition dependence has a genetic basis12,13,14, and that sexual selection removes deleterious mutations15,16,17,18. However, molecular evidence for these processes remains largely elusive.
Evolve and resequence19, where the genomes of experimental populations are sequenced following artificial selection or experimental evolution19,20,21,22,23,24, offers a powerful approach for testing evolutionary hypotheses. Applying bidirectional, artificial selection to a focal trait should cause bidirectional changes in allele frequencies at loci under selection. Under the genic capture hypothesis, unconditionally deleterious (and, therefore, rare) alleles that reduce condition are largely expected to contribute to quantitative trait variation. In lines selected for lower fitness, the frequency of rare, deleterious alleles should increase, causing an increase in genetic variance25, a simple prediction that arises from single-locus models (e.g., see Figure 8.1 in ref. 26). The opposite should be true for selection for increased fitness. Alternative explanations for the maintenance of genetic variation largely rely on some form of fluctuating or balancing selection (e.g., overdominance27 or intralocus sexual conflict28), where alleles under selection should be at more intermediate frequencies and/or high or low frequencies with variable effects on fitness (see ref. 29). Although rare alleles may predominantly contribute to trait variation under such alternative models, their effects are expected to be more varied; some rare alleles may increase trait values, whereas other rare alleles decrease trait values. Thus, selection should have more variable effects on genetic variance across loci and the overall genetic variance may even increase in both directions of selection25.
In a previous experiment, we showed that bidirectional, artificial selection on male mating success generated a strong evolutionary response; lines selected for males that won mate choice trials (success-selected) had higher mating success than lines selected for losers (failure-selected)18. The response to selection was correlated with changes in inbreeding depression (in egg-to-adult viability) and sperm competitiveness, potentially indicating that condition, which underlies these traits, may have been the target of sexual selection. Moreover, traits linked to mating success in D. melanogaster (e.g., body size19 and courtship song24) are highly polygenic. Thus, our biometric data, together with previous molecular data, align with the genic capture hypothesis.
In this study, we resequence the genomes of our experimentally diverged lines to specifically test the predictions outlined above. We show that success-selected lines have significantly less genetic variation than failure-selected lines, particularly at loci associated with divergence. In addition, we find no evidence that specific sets of genes are associated with divergence in mating success, indicating that condition could be the target of selection. Thus, our results demonstrate that sexual selection erodes molecular genetic variation and suggest that this genetic variation is maintained in mutation-selection balance across the genome, thereby supporting the genic capture hypothesis.
To assess the molecular signature of sexual selection in our system, we resequenced the genomes of flies from each of 11 experimental lines (4 success-selected, 4 failure-selected, and 3 control). These lines were generated by applying bidirectional artificial selection to male mating success. Briefly, we selected on mating success by exposing males to mate choice trials, and using males that mounted females to propagate four success-selected lines (N = 25 males and 25 females) and males that failed to mount females to propagate four failure-selected lines (N = 25 males and 25 females). Four control lines were maintained with flies not exposed to the choice trials (only 3 were used in this study; N = 25 males and 25 females). Following 14 generations of selection, we collected 70 females from each line and stored them in ethanol before analysis. We used females so that sample size of the X chromosome was the same as the autosomes. Although we may have missed important variation on the Y chromosome, the primary focus of this study was on the genome-wide effects of selection.
Tissue from the 70 pooled females was homogenised, DNA extracted, and the whole genome sequenced. Pooled sequencing (pool-seq30) yields allele counts for the common and rare alleles at each locus, from which we calculated allele frequencies and expected heterozygosity (He; 2pq). After quality control and screening out loci where the frequency of rare allele was < 0.05 across all 11 lines (within any line, the frequency could be < 0.05, but had to be ≥ 0.05 overall), we identified 55,277 polymorphisms. The allele frequency spectra for all lines can be seen in Supplementary Figure 1a. We then aimed to identify polymorphisms that were associated with divergence (candidate loci) and used these to compare patterns of He and Gene Ontology (GO).
Identifying candidate loci
We applied two approaches for identifying candidate loci. For our first approach, we calculated the DiffStat score at each locus as the minimum allele frequency difference between any combination of success-selected vs. failure-selected lines19. The DiffStat is zero unless all four lines of one selection regimen have a higher or a lower allele frequency than all four lines from the other selection regimen. We then calculated the maximum allele frequency difference between any combination of lines within a selection regimen (i.e., variation caused by drift). Candidate loci were those where the DiffStat was greater than drift. Below, we discuss Diffstat loci (i.e., where Diffstat > 0), non-Diffstat loci (Diffstat = 0), and significant DiffStat loci (Diffstat > drift). For our second approach, we analysed the effect of sexual selection regimen (success-selected vs. failure-selected) on allele frequencies (i.e., the counts of the common and rare allele) at each locus using quasibinomial generalised linear models (GLMs), adjusting for multiple testing using q-values with a false discovery rate (FDR) of 0.0531,32. We identified 102 significant DiffStat loci, 88 significant loci from the GLM approach, and 68 loci significant in both approaches. To be conservative, the following analyses were performed using the 68 significant loci identified from both approaches, which we henceforth refer to as significantly diverged variants (SDVs).
We first evaluated the He of success- and failure-selected lines at SDVs. If purifying sexual selection is acting, then the He should be lower in success-selected lines for SDVs. Only 3 of the 68 SDVs (4.4%) had higher He in success-selected lines. This pattern was consistently strong across all DiffStat loci (i.e., including non-SDV loci), where the He was higher among success-selected lines for only 272 out of 1363 (20.0%) DiffStat loci (Fig. 1; Supplementary Figure 1b). It is noteworthy that variation in the density of SDVs and DiffStat loci could be caused by variation in the strength of selection, but could also easily be explained by, e.g., variation in recombination rate along the chromosome arms, which is known to correlate with nucleotide diversity33. Below, we refer to the proportion of DiffStat loci that have less variation in the success-selected lines (i.e., 0.800 here) as PS < F. These results not only show that success-selected lines have less genetic variation than failure-selected lines, suggesting a role for mutation-selection balance in maintaining genetic variation, but suggest that this effect is widespread, supporting the genic capture hypothesis.
The above analysis does not account for linkage or the density of polymorphisms. Therefore, we compared He between selection regimens within 50 kb sliding windows with 10 kb step sizes. We calculated the mean He in each sliding window for each experimental line. Our first interest was in identifying sliding windows that had more SDVs than expected by chance (enriched windows), as these represent areas of the genome most likely to be under selection. Of the 12,424 sliding windows across the five chromosome arms, we identified 265 windows with at least 1 SDV and 57 windows that were enriched for SDVs (significant windows). Given the number of sliding windows and the number of significant windows, we then used a resampling approach to test whether enriched windows were evenly distributed across the five chromosome arms. Significant windows were underrepresented on chromosome 2R, but were otherwise distributed across the other four chromosome arms.
We then compared He between selection regimens both genome-wide and within significant windows (see Supplementary Tables 1–2 for He statistics for each chromosome arm). Across all sliding windows, He was significantly lower in success-selected lines compared with failure-selected lines (Fig. 2a), as determined by a simple t-test of the eight line medians (t = 2.49; df = 6; P = 0.047). This pattern was particularly strong in enriched windows (t = 9.41; df = 6; P = 0.0001; Fig. 2b). The He of control lines was relatively intermediate for all windows and for significant windows (Fig. 2), suggesting that sexual selection increased variation in failure-selected lines and depleted variation in success-selected lines. Together, these results support the main premise of the lek paradox that sexual selection erodes genetic variation and suggest a role for mutation-selection balance in maintaining genetic variation. Moreover, success-selected lines had lower genome-wide (across all sliding windows) gene diversity, consistent with the large mutation target hypothesised under the genic capture hypothesis (evident in Fig. 1), where rare, deleterious mutations from across the genome are implicated in the maintenance of genetic variation6,34.
Starting allele frequencies
Next, we tested whether alleles under selection were likely to have been rare in the base population, as hypothesised if genetic variation is maintained in mutation-selection balance. We used the mean of the three control populations as our estimate of the starting allele frequency at each locus. We then compared the median starting allele frequency of the SDVs to the genome-wide background. To do so, we randomly resampled (with replacement) 10,000 sets of 68 starting allele frequencies from across the genome and calculated the median allele frequency for each sample, generating a distribution of 10,000 starting allele frequencies. The observed median starting allele frequency of the SDVs was 0.077, significantly lower than the median starting allele frequency, 0.113 (0.085–0.141, 95% confidence intervals) generated by random sampling (i.e., the genome-wide background). This result indicates that SDVs tended to be at low frequency in the starting population, in line with mutation-selection balance.
We then evaluated whether SDVs were involved with certain molecular pathways. Under the genic capture hypothesis, most of the genome should be involved with the expression of sexually selected traits and, therefore, there is no a priori reason to expect that specific molecular pathways will be involved with divergence in mating success. To assess this, we explored the GO of the SDVs using GOWINDA35. We identified 763 GO terms associated with the SDVs; however, none of these GO terms were significant (i.e., no terms were associated with the SDVs more than expected by chance alone). Although this result aligns with the genic capture hypothesis in suggesting that a broad array of molecular pathways (condition) were associated with divergence, it is also possible that the lack of significant GO terms is a simple consequence of having relatively few SDVs to test.
Genetic drift simulation
Finally, we performed a series of simulations to test whether our observed results could be reproduced through genetic drift alone. Specifically, we tested whether a comparable reduction in genetic variation could arise if the effective population size (Ne) of success-selected lines was smaller than the Ne of the failure-selected lines. Our observed FST estimate for success-selected lines was slightly higher than for failure-selected lines (0.182 and 0.162, respectively), indicating that Ne may have been smaller in success-selected lines. We simulated neutral evolution for 14 generations with a range of Ne ratios and compared measures of genetic variation (He and PS < F) and within-regimen genetic divergence (FST). The results from these simulations indicated that differences in Ne between selection regimens could generate similar patterns of He and PS < F, but not without substantially inflating FST estimates (see Methods and Supplementary Discussion for a full discussion). Hence, it is highly unlikely that genetic drift alone caused the reduction in He in success-selected lines.
Our results provide molecular evidence in support of key evolutionary hypotheses. First, decreased He in success-selected lines supports the main premise of the lek paradox that female choice erodes genetic variation. However, we previously demonstrated that deleterious recessive mutations affecting mating success were still segregating in success-selected populations18, so the biological significance of the extent to which genetic variation has actually been depleted by selection remains unclear. Second, the pattern of He that we identified aligns with mutation-selection balance, where the alleles contributing to variation in male mating success in the base populations were rare and consistent in their effects. Our molecular evidence for mutation-selection balance supports our previous finding that success-selected lines were purged of recessive mutations affecting viability, whereas failure-selected lines were not18. Third, our results are consistent with the genic capture hypothesis. We found that the median He across all sliding windows was lower in the success-selected lines, and that He was consistently lower at DiffStat loci, at least across the two autosomes. In addition, we found no significant GO terms, demonstrating that no specific molecular pathways have been involved with divergence and suggesting that mutations to any genes can affect mating success through deleterious effects on condition. Together, our results provide comprehensive biometric18 and molecular evidence in support of a genic capture resolution to the lek paradox.
We found no evidence that intermediate frequency alleles were important to quantitative trait variation underlying mating success, which might have been expected under alternative models for the maintenance of genetic variation. For example, sexual conflict36,37,38 is well documented among drosophilids39,40,41,42,43 and intralocus sexual conflict44 should maintain alleles at relatively intermediate frequencies through balancing selection. Similarly, molecular evidence has demonstrated that trade-offs between natural and sexual selection can maintain genetic variation through balancing selection27. These kinds of antagonistic effects may be prominent in long-standing laboratory populations that are near their adaptive peak45 and exploring how genetic architecture changes with laboratory adaptation warrants further investigation. Here, using a recently established laboratory population, we found compelling evidence for widespread, negative selection against rare alleles, supporting our previous finding of no trade-offs18. More tests such as these are required; we still know very little about the extent to which rare alleles in mutation-selection balance contribute to quantitative trait variation.
The most parsimonious alternative explanation for the pattern of purging that we identified was a smaller Ne of success-selected lines. Indeed, despite equalising the number of breeding individuals across all lines (N = 50) following the selection screen, our FST estimates indicated that the Ne of success-selected lines may still have been slightly lower than the Ne of the failure-selected lines. Although our simulations demonstrate that the smaller Ne could not account for the extent of purging that we identified, it would be worthwhile testing the interacting effects of sexual selection and reduced Ne on molecular (in conjunction with quantitative) genetic variation, as sexual selection does directly reduce Ne46,47. In particular, evaluating whether or not adaptive variation is eroded by sexual selection in this context would be a valuable avenue to explore. For example, in conservation programs, it might be necessary to weigh potential benefits of sexual selection (purging deleterious mutations15,16,17,18,48) with the associated cost (loss of genetic variation). In other words, to what extent does sexual selection purge unconditionally deleterious variation and to what extent does it also purge variation with adaptive potential?
Our results highlight the promise of using evolve-and-resequence to test specific hypotheses in sexual selection theory. However, there remain important limitations. First, populations were small and, therefore, the relative effects of genetic drift, as well as the effects of linkage disequilibrium (LD), are enhanced. As a result of both factors, identifying polymorphisms linked to divergence can be compromised. Drift overriding selection will serve to limit our ability to accurately detect SDVs. LD, on the hand, can cause a large number of false positives in evolve and resequence studies23,49, particularly in small populations when there is strong selection on rare alleles. Our sliding window analysis accounts for short to medium range linkage (10 kb) to some extent, but any effects of long-range LD have not been accounted for and can be substantial23. Second, our primary focus was on purifying selection against deleterious (and, therefore, rare) mutations. An initial step in finding loci for analyses was the removal of loci where the frequency of the rare allele was < 0.05, potentially the most important loci. As a result, our evidence for purifying selection is likely to be conservative. Third, males in failure-selected lines had to mate with a female, in order to generate the next generation. Therefore, selection on mating success was weakened, potentially reducing our ability to detect loci strongly associated with variation mating success. Fourth, females used in choice trials were collected from the same experimental line as the competing males. If, e.g., female preferences differ among lines within a regimen, then selection might be acting on different loci and/or have divergent effects across lines. However, in the final mating success assay, males from each line were competed against standard males for access to standard females (both from the base population). Success-selected males had higher mating success than failure-selected males, so to a large degree the preferences of females in the selection lines aligned with the base population females. In addition, variation in preferences among lines would only serve to increase variance between replicates with regimens. Finally, we used a relatively fresh laboratory stock population in our study. To some extent, experimental lines would still be adapting to the laboratory. Theoretical50 and empirical evidence51 suggests that sexual selection can promote adaptation. Therefore, some of the effects that we see may be a consequence of this process. Although this does not undermine our results in any way, it would be interesting to investigate whether the molecular effects of sexual selection differ when populations are near to, or far from, their adaptive peak.
Taken together, our results are molecular evidence for the erosion of genetic variation that sets up the lek paradox and for the genic capture that resolves it. More generally, these results contribute to our understanding of the evolutionary persistence of genetic variation in the face of strong, directional selection.
Overview and selection
The stock population of D. melanogaster used in this experiment was established by collecting flies en masse from Innisfail, Queensland, in 201218. Stock flies were maintained in cage culture at 25 °C with overlapping generations and a population size of thousands. Artificial selection lines were established approximately ten generations later, after a brief period of laboratory adaptation. We then applied bidirectional selection to male mating success for 14 generations (there were seven generations of selection, three generations of relaxed selection, and a further seven generations of selection) by exposing 7–8-day-old, non-virgin males to binomial mate choice trials and selecting either males that only ever won in mating trials (success-selected; four replicates) or males that only ever lost in mating trials (failure-selected; four replicates). Non-virgin females that were used in mate choice trials were collected from the same line as the males for a given choice trial. Four control lines were maintained with males not exposed to the mate choice trails (only three are represented in this study as one was lost before sequencing). For each line, 25 males (successful/unsuccessful/control) and 25 virgin females were used to establish each generation, making the population size ~50 flies. However, in establishing each generation, we had five replicate vials of five males and five females, which were allowed to mate freely and lay eggs over 24 h. Therefore, within a vial (and indeed between vials), there was the opportunity for selection to act (i.e., selection after the selection assay), both on males and females.
After one generation of relaxed selection, we collected 70 females from each selection line and 3 control lines, and stored them in 100% ethanol at −20 °C before DNA extraction. We used females to keep the sample size of the X chromosome and the autosomes the same. Although we may have missed important variation on the Y chromosome, our main test was for mutation-selection balance across the genome. DNA extraction, library construction (Illumina gDNA shotgun library preparation), and genome sequencing were all performed by the Australian Genome Research Facility. Briefly, DNA was extracted from the homogenised tissue of 70 flies (for each line) using NucleoBond Buffer Set IV (Macherey-Nagel). One hundred base pair paired-end sequences were constructed and sequenced using an Illumina HiSeq2500 platform. Image analysis was performed in real time by the HiSeq Control Software v2.2.68 and Real Time Analysis v188.8.131.52. Sequence data were generated using the Illumina bcl2fastq 184.108.40.206 pipeline. Raw fastq sequences were mapped to the D. melanogaster reference genome downloaded from FlyBase (flybase.org; r6.08) using BWA version 0.7.12-r103954. Sam files were converted to bam files using samtools55. Variant calling was performed using GATK56, with a UnifiedGenotyper algorithm and ploidy settings equal to 1. The total number of paired-end reads ranged from 21.5 million to 23.2 million across the 11 samples (Supplementary Table 3). Variants different to the reference genome were identified as polymorphisms, which included all insertion/deletion/single nucleotide polymorphisms. Of the 1.4 million polymorphic loci, only ~10% were represented in all 11 populations; all others were discarded. We further excluded reads with mapping qualities < 15 and where the coverage was < 5 or > 250 for any population. For any given population, ~96% of remaining loci had coverage ranging from 15 to 45, with a mean of ~30 (Supplementary Table 3). Finally, we excluded loci where the global frequency of the minor allele was < 5% (i.e., the frequency of the minor allele across all 11 populations), as very rare variants may represent sequencing errors and would not be informative for analyses24. This makes the analyses conservative given that the main focus of the study is rare variants. Any other errors are expected to be randomly distributed across the 11 samples. It is noteworthy that each locus did not have to be polymorphic in each population, but there had to be at least five reads from each population for a locus to be included in analyses. After screening, we had 55,277 insertion/deletion/single nucleotide polymorphisms (47,103 single-nucleotide polymorphisms).
Identifying candidate loci
We applied two approaches for identifying candidate loci (i.e., variants that may have been under sexual selection). For our first approach, we evaluated whether divergence between selection lines from different regimens exceeded divergence between lines within regimes. We first calculated the frequency of the rare allele at each locus for each population. We then identified loci where the frequency of the minor allele was consistently higher or lower in success-selected lines compared with failure-selected lines. At these loci, we recorded the minimum allele frequency difference between any combination of success-selected vs. failure-selected line, the DiffStat for that locus19. The DiffStat is zero unless all four lines of one selection regimen have a higher or a lower allele frequency than all four lines from the other selection regimen. We identified 1363 DiffStat loci (i.e., loci where DiffStat > 0). We then calculated the maximum allele frequency difference between any combination of populations within a selection regimen (i.e., variation caused by drift). Candidate loci were those where the DiffStat exceeded drift for that locus.
For our second approach, we analysed the effect of selection regimen on the read counts of the common and rare alleles with quasibinomial GLMs, generating P-values at each locus with analysis of variances with default F tests. We adjusted for multiple testing using q-values with a FDR of 0.0531,32. Values of P < 9.50e−5 were statistically significant. In total, we identified 102 candidates using the first approach and 88 candidates using the second approach (68 were identified by both and these were used in the analyses).
As well as being performed on success vs. failure selection, GLMs were also performed independently for each selection regimen against the control lines to test whether allele frequencies at any loci had significantly diverged from the controls in opposite directions. We identified only one locus (which is not surprising given that population sizes were small and 11 populations were used) where the control lines were significantly divergent from both selection regimens in opposite directions (FBgn0083975; neuroligin 4).
Hypergeometric test for overrepresentation of SDVs
We performed a sliding window analysis of He (2pq; where p and q are the frequencies of the two alleles) to account for differences in the density of loci across the genome. We performed the analysis with 50 kb sliding windows with 10 kb step sizes. We then tested for overrepresentation of SDVs in 50 kb sliding windows21. Based on the number of variants and SDVs per chromosome arm, we calculated the probability of identifying s SDVs from n variants in each window. As with the GLMs, we corrected for genome-wide multiple testing using q-values. Given the very low number of SDVs, any windows with two or more SDVs were significantly enriched (i.e., had more SDVs than expected by chance). Overall, we identified 265 windows with at least one SDV and 57 of these had two or more.
We then tested whether or not the 57 significant windows were over- or underrepresented on any of the five major chromosome arms. To do so, we resampled 10,000 sets of 57 windows from across the five major arms and recorded the arm on which the windows were represented. We then compared the location of our 57 significant windows to the null distribution generated from the resampling. Significant windows were underrepresented on chromosome 2R (i.e., fewer windows than the 2.5% confidence interval of six). For the other four chromosome arms, the number of significant windows fell within the 95% confidence intervals.
Expected heterozygosity (H e) in sliding windows
We calculated the median He across all 50 kb sliding windows for each chromosome arm and then for the whole genome21 (Supplementary Tables 1–2). We then repeated this for the windows with > 1 SDV (i.e., significant windows).
We explored the GO of the significant variants to identify the function of gene sets associated with the divergence using GOWINDA35. We downloaded GO terms from FuncAssociate3.0 and an annotated genome from FlyBase. We ran the enrichment analysis on the SDVs and identified 763 GO terms. GOWINDA performs permutations tests (100,000 simulations) to calculate the FDR-corrected significance of overrepresentation of GO terms by randomly sampling sets of polymorphisms from the data and recording the genes associated with each polymorphism35. The observed GO terms of the SDVs are then compared with this null distribution to identify FDR-corrected significance levels. We identified no significant GO terms.
Simulating genetic drift
To test whether our observed patterns for DiffStats and He could be explained by genetic drift alone, we simulated neutral evolution for 14 generations while manipulating the Ne of the selection lines.
For each simulation, we simulated data for four success-selected lines and four failure-selected lines. For one set of simulations, we held the Ne of the failure-selected lines (Nfailure) constant at 50 and manipulated Nsuccess from 20 to 50. For a second set of simulations, we held Nfailure = 33 (~2/3 of the census population size57) and Nsuccess was 20–33. Our experimental lines comprised 25 males and 25 females; however, we did not control for variation in reproductive success, which may reduce Ne. A comparisons of these simulations allowed us to evaluate the effects of different overall Ne on the parameters calculated below.
As loci do not all segregate independently (linked loci will be in LD), we did not simulate data for all 55,277 loci. Instead, we first estimated the extent of LD among linked loci and used this to approximate the number of independently segregating loci in our study. To do so, we assessed the decay (i.e., the increase) of P-values surrounding SDVs. We calculated the mean P-values of loci within 50 bp non-overlapping sliding windows and concluded that LD had broken down when the curve peaked (Supplementary Figure 3). For all chromosome arms, LD breaks down within 10 kbps. Therefore, the number of loci (nloci) that we used in our simulations was determined by effectively sampling one locus every 10 kbps, a total of 12,424 loci. To be conservative, we also ran the simulations on nloci = 2640, sampling one locus every 50 kbps.
We did not resequence flies from the base population and, therefore, had no estimate of starting allele frequencies at each locus. As above, we used the mean of the three control lines as our estimate starting allele frequency in the base. For each locus in the simulations, we sampled, with replacement, from these starting allele frequencies. The mean frequency of the rare allele in the control lines was ~0.12.
For each selection line, we then sampled 2N (where N is the population size) alleles with values of 0 or 1 for each locus/line, with the probability of sampling allele 1 equal to the frequency of that allele each generation. To simulate genetic drift, alleles were sampled, with replacement, for 14 generations. In our experiment, there were an additional four generations of genetic drift among our lines when selection was relaxed; however, population sizes during these generations were large (in the order of several hundred) and genetic drift should have been negligible. Thus, we did not include these four generations in our simulations. After generation 14, we sampled k alleles at each locus, where k was identified by sampling (with replacement) a random value from a vector of coverages. Elements of this vector were calculated by taking the average (across all 11 populations) number of reads for each locus from our own data. This final step reflects the random nature of sequencing. We only included loci where the minor allele frequency was ≥ 0.05 and we continued to simulate data until the desired nloci was reached.
Using the final allele frequencies, we then calculated between line FST values for each regimen (using the BEDASSLE R package58), the median He across all loci for each line, and the DiffStats along with their direction (i.e., whether the success- or failure-selected lines had lower He at that DiffStat locus). We then recorded three statistics: (i) the difference between FST values between regimens (DFST), (ii) the proportion of diffstat scores where the success-selected lines had less variation (PS < F), and (iii) the difference between the mean He of the failure-selected lines and the mean He of the success-selected lines (DHe). We also recorded the numbers of DiffStat and significant DiffStat (calculated as above) loci to evaluate how many we expect through drift alone. We repeated this simulation 1000 times for each nloci and for each Nsuccess, and recorded the mean and 95% confidence intervals for each statistic. All three statistics will be functions of Nsuccess; we use their relationships to each other to indicate whether genetic drift could cause our results.
The data that support the findings of this study are available in figshare with the identifier doi:10.6084/m9.figshare.7692365. The source data underlying Fig. 2 are provided as a Source Data file.
Borgia, G. in Sexual Selection and Reproductive Competition in Insects (eds M. S. Blum & N. A. Blum) 19–80 (Academic, 1979).
Kirkpatrick, M. & Ryan, M. J. The evolution of mating preferences and the paradox of the lek. Nature 350, 33–38 (1991).
Taylor, P. D. & Williams, G. C. The lek paradox is not resolved. Theor. Popul. Biol. 22, 392–409 (1982).
Kingsolver, J. G. et al. The strength of phenotypic selection in natural populations. Am. Nat. 157, 245–261 (2001).
Pomiankowski, A. & Møller, A. A resolution of the lek paradox. Proc. R. Soc. Lond. B Biol. Sci. 260, 21–29 (1995).
Rowe, L. & Houle, D. The lek paradox and the capture of genetic variance by condition dependent traits. Proc. R. Soc. B Biol. Sci. 263, 1415–1421 (1996).
Andersson, M. Sexual selection, natural selection and quality advertisement. Biol. J. Linn. Soc. 17, 375–393 (1982).
Andersson, M. Evolution of condition-dependent sex ornaments and mating preferences: sexual selection based on viability differences. Evolution 40, 804–816 (1986).
Houle, D. Comparing evolvability and variability of quantitative traits. Genetics 130, 195–204 (1992).
Houle, D. How should we explain variation in the genetic variance of traits? Genetica 102, 241–253 (1998).
Lynch, M. & Walsh, B. Genetics and Analysis of Quantitative Traits. (Sinauer Associates, Sunderland, 1998).
David, P., Bjorksten, T., Fowler, K. & Pomiankowski, A. Condition-dependent signalling of genetic variation in stalk-eyed flies. Nature 406, 186 (2000).
Kotiaho, J. S., Simmons, L. W. & Tomkins, J. L. Towards a resolution of the lek paradox. Nature 410, 684–686 (2001).
Wilkinson, G. S. & Taper, M. Evolution of genetic variation for condition-dependent traits in stalk-eyed flies. Proc. R. Soc. Lond. B Biol. Sci. 266, 1685–1690 (1999).
Almbro, M. & Simmons, L. W. Sexual selection can remove an experimentally induced mutation load. Evolution 68, 295–300 (2014).
Lumley, A. J. et al. Sexual selection protects against extinction. Nature 522, 470–473 (2015).
Radwan, J. Effectiveness of sexual selection in removing mutations induced with ionizing radiation. Ecol. Lett. 7, 1149–1154 (2004).
Dugand, R. J., Kennington, W. J. & Tomkins, J. L. Evolutionary divergence in competitive mating success through female mating bias for good genes. Sci. Adv. 4, eaaq0369 (2018).
Turner, T. L., Stewart, A. D., Fields, A. T., Rice, W. R. & Tarone, A. M. Population-based resequencing of experimentally evolved populations reveals the genetic basis of body size variation in Drosophila melanogaster. PLoS. Genet. 7, e1001336 (2011).
Chenoweth, S. F., Appleton, N. C., Allen, S. L. & Rundle, H. D. Genomic evidence that sexual selection impedes adaptation to a novel environment. Curr. Biol. 25, 1860–1866 (2015).
Jha, A. R. et al. Whole-genome resequencing of experimental populations reveals polygenic basis of egg-size variation in Drosophila melanogaster. Mol. Biol. Evol. 32, 2616–2632 (2015).
Orozco-terWengel, P. et al. Adaptation of Drosophila to a novel laboratory environment reveals temporally heterogeneous trajectories of selected alleles. Mol. Ecol. 21, 4931–4941 (2012).
Tobler, R. et al. Massive habitat-specific genomic response in D. melanogaster populations during experimental evolution in hot and cold environments. Mol. Biol. Evol. 31, 364–375 (2013).
Turner, T. L. & Miller, P. M. Investigating natural variation in Drosophila courtship song by the evolve and resequence approach. Genetics 191, 633–642 (2012).
Kelly, J. K. Testing the rare-alleles model of quantitative variation by artificial selection. Genetica 132, 187–198 (2008).
Falconer, D. & Mackay, T. Introduction to Quantitative Genetics. (Longman Scientific & Technical, Essex, 1996).
Johnston, S. E. et al. Life history trade-offs at a single locus maintain sexually selected genetic variation. Nature 502, 93–95 (2013).
Tarka, M., Åkesson, M., Hasselquist, D. & Hansson, B. Intralocus sexual conflict over wing length in a wild migratory bird. Am. Nat. 183, 62–73 (2013).
Josephs, E. B., Stinchcombe, J. R. & Wright, S. I. What can genome‐wide association studies tell us about the evolutionary forces maintaining genetic variation for quantitative traits? New Phytol. 214, 21–33 (2017).
Futschik, A. & Schlötterer, C. The next generation of molecular markers from massively parallel sequencing of pooled DNA samples. Genetics 186, 207–218 (2010).
Dabney, A., Storey, J. D. & Warnes, G. qvalue: Q-value estimation for false discovery rate control. R package version (2013).
Storey, J. D. & Tibshirani, R. Statistical significance for genomewide studies. Proc. Natl Acad. Sci. USA 100, 9440–9445 (2003).
Nachman, M. W. Single nucleotide polymorphisms and recombination rate in humans. Trends Genet. 17, 481–485 (2001).
Tomkins, J. L., Radwan, J., Kotiaho, J. S. & Tregenza, T. Genic capture and resolving the lek paradox. Trends Ecol. Evol. 19, 323–328 (2004).
Kofler, R. & Schlötterer, C. Gowinda: unbiased analysis of gene set enrichment for genome-wide association studies. Bioinformatics 28, 2084–2085 (2012).
Arnqvist, G. & Rowe, L. Sexual Conflict (Princeton Univ. Press, 2005).
Chapman, T., Arnqvist, G., Bangham, J. & Rowe, L. Sexual conflict. Trends Ecol. Evol. 18, 41–47 (2003).
Parker, G. in Sexual Selection and Reproductive Competition in Insects (eds M. S. Blum & N. A. Blum) 123–166 (Academic, 1979).
Abbott, J. K., Bedhomme, S. & Chippindale, A. K. Sexual conflict in wing size and shape in Drosophila melanogaster. J. Evol. Biol. 23, 1989–1997 (2010).
Chippindale, A. K., Gibson, J. R. & Rice, W. R. Negative genetic correlation for adult fitness between sexes reveals ontogenetic conflict in Drosophila. Proc. Natl Acad. Sci. USA 98, 1671–1675 (2001).
Holland, B. & Rice, W. R. Experimental removal of sexual selection reverses intersexual antagonistic coevolution and removes a reproductive load. Proc. Natl Acad. Sci. USA 96, 5083–5088 (1999).
Pischedda, A. & Chippindale, A. K. Intralocus sexual conflict diminishes the benefits of sexual selection. PLoS Biol. 4, 2099–2103 (2006).
Stewart, A. D., Morrow, E. H. & Rice, W. R. Assessing putative interlocus sexual conflict in Drosophila melanogaster using experimental evolution. Proc. R. Soc. Lond. B Biol. Sci. 272, 2029–2035 (2005).
Bonduriansky, R. & Chenoweth, S. F. Intralocus sexual conflict. Trends Ecol. Evol. 24, 280–288 (2009).
Long, T. A. F., Agrawal, A. F. & Rowe, L. The effect of sexual selection on offspring fitness depends on the nature of genetic variation. Curr. Biol. 22, 204–208 (2012).
Charlesworth, B. Effective population size and patterns of molecular evolution and variation. Nat. Rev. Genet. 10, 195 (2009).
Nunney, L. The influence of mating system and overlapping generations on effective population size. Evolution 47, 1329–1341 (1993).
Jarzebowska, M. & Radwan, J. Sexual selection counteracts extinction of small populations of the bulb mites. Evolution 64, 1283–1289 (2010).
Kofler, R. & Schlötterer, C. A guide for the design of evolve and resequencing studies. Mol. Biol. Evol. 31, 474–483 (2014).
Lorch, P. D., Proulx, S., Rowe, L. & Day, T. Condition-dependent sexual selection can accelerate adaptation. Evolut. Ecol. Res. 5, 867–881 (2003).
Fricke, C. & Arnqvist, G. Rapid adaptation to a novel host in a seed beetle (Callosobruchus maculatus): the role of sexual selection. Evolution 61, 440–454 (2007).
R: A language and environment for statistical computing (R Foundation for Statistical Computing, Vienna, Austria, 2010).
Wickham, H. ggplot2: Elegant Graphics for Data Analysis. (Springer-Verlag, New York, 2016).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Buri, P. Gene frequency in small populations of mutant Drosophila. Evolution 10, 367–402 (1956).
Bradburd, G. & Bradburd, M. G. Package ‘BEDASSLE’. Comprehensive R Archive Network (2013).
We thank Jacob Berson and the Centre for Evolutionary Biology for constructive discussion. This work was funded by the School of Biological Sciences (The University of Western Australia) and an Australian Research Council Future Fellowship to J.L.T. (FT110100500). R.J.D. was supported by an Australian Postgraduate Award Scholarship through The University of Western Australia.
The authors declare no competing interests.
Journal peer review information: Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Dugand, R.J., Tomkins, J.L. & Kennington, W.J. Molecular evidence supports a genic capture resolution of the lek paradox. Nat Commun 10, 1359 (2019). https://doi.org/10.1038/s41467-019-09371-y