Quantifying separation and similarity in a Saccharomyces cerevisiae metapopulation

Knight, Sarah; Goddard, Matthew R

doi:10.1038/ismej.2014.132

Download PDF

Original Article
Open access
Published: 25 July 2014

Quantifying separation and similarity in a Saccharomyces cerevisiae metapopulation

Sarah Knight¹ &
Matthew R Goddard¹

The ISME Journal volume 9, pages 361–370 (2015)Cite this article

4996 Accesses
59 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Eukaryotic microbes are key ecosystem drivers; however, we have little theory and few data elucidating the processes influencing their observed population patterns. Here we provide an in-depth quantitative analysis of population separation and similarity in the yeast Saccharomyces cerevisiae with the aim of providing a more detailed account of the population processes occurring in microbes. Over 10 000 individual isolates were collected from native plants, vineyards and spontaneous ferments of fruit from six major regions spanning 1000 km across New Zealand. From these, hundreds of S. cerevisiae genotypes were obtained, and using a suite of analytical methods we provide comprehensive quantitative estimates for both population structure and rates of gene flow or migration. No genetic differentiation was detected within geographic regions, even between populations inhabiting native forests and vineyards. We do, however, reveal a picture of national population structure at scales above ∼100 km with distinctive populations in the more remote Nelson and Central Otago regions primarily contributing to this. In addition, differential degrees of connectivity between regional populations are observed and correlate with the movement of fruit by the New Zealand wine industry. This suggests some anthropogenic influence on these observed population patterns.

Macroevolutionary diversity of traits and genomes in the model yeast genus Saccharomyces

Article Open access 08 February 2023

David Peris, Emily J. Ubbelohde, … Chris Todd Hittinger

Quantifying the effect of human practices on S. cerevisiae vineyard metapopulation diversity

Article Open access 01 October 2020

Marine Börlin, Olivier Claisse, … Isabelle Masneuf-Pomarede

Soil bacterial populations are shaped by recombination and gene-specific selection across a grassland meadow

Article Open access 23 April 2020

Alexander Crits-Christoph, Matthew R. Olm, … Jillian F. Banfield

Introduction

Research to date attempting to elucidate the patterns and processes involved in shaping natural populations has largely focused on readily observed macroorganisms but comparatively little work has been conducted on microbial species (Anderson and Kohn, 1998; Martiny et al., 2006; Prosser et al., 2007). Because of their large population sizes, and ease of transfer, one might expect microbial populations to be well mixed (Finlay, 2002); however, there is increasing evidence showing that many are not homogeneous but structured (Taylor et al., 2006; Whitaker and Banfield, 2006; Hanson et al., 2012). Most microbial ecology studies have focused on bacteria, but eukaryotic microbes, which undergo sex (with recombination), also have key ecosystem roles (Green et al., 2008; Van Der Heijden et al., 2008). It is not clear whether the population patterns estimated for eukaryotic ‘macrobes’ generally hold for eukaryotic microbes (Hartl and Clark, 1997; Anderson and Kohn, 1998; Halkett et al., 2005; Taylor et al., 2006; Prosser et al., 2007; Tsai et al., 2008).

A metapopulation comprises a number of spatially separated populations of the same species that interact to some extent. To date all studies examining microbial populations have simply examined whether population structure is evident or not (Aa et al., 2006; Achtman, 2008; Liti et al., 2009; Goddard et al., 2010; Anderson and Shearer, 2011; Härnström et al., 2011; Gayevskiy and Goddard, 2012; Wang et al., 2012). Merely defining microbial populations as either structured or homogeneous is highly unlikely to reflect the true biological situation. A more accurate approach is not only to assess the degree to which populations are structured, but also connected by gene flow, and crucially go onto quantify these processes; however, to the best of our knowledge, there are no previous studies that have used a unified framework to do this. Here we analyse the natural population of Saccharomyces cerevisiae in New Zealand (NZ), and in doing so take the first steps towards quantifying microbial population structure and similarity.

S. cerevisiae, a budding yeast, has been closely associated with humans since the dawn of civilisation because of its fermentative capabilities, and has come to be of significant commercial importance in the production of bread, wine, beer and other alcoholic beverages (McGovern et al., 1996; Pretorius, 2000; Cavalieri et al., 2003; Chambers and Pretorius, 2010). S. cerevisiae is also a classic model organism for research into cell biology, genetics and increasingly ecology and evolution (Chambers and Pretorius, 2010; Dujon, 2010; Gray and Goddard, 2012; Hittinger, 2013; Hyma and Fay, 2013). Recent studies have revealed a large genetic diversity within S. cerevisiae, and there is good evidence for population structure at intercontinental scales (Fay and Benavides, 2005; Schuller et al., 2005; Aa et al., 2006; Lopandic et al., 2008; Liti et al., 2009; Goddard et al., 2010; Mercado et al., 2011; Di Maio et al., 2012; Wang et al., 2012). Similar inferences have been made at finer scales with reports that some genotypes were unique to different geographic locations in Austria, although many were also ubiquitous across regions (Lopandic et al., 2008). In addition, Bayesian inference shows genetic differentiation between populations spanning hundreds of kilometres in NZ (Gayevskiy and Goddard, 2012). While the scales of these studies differ, they all commonly report the presence of hybrid or mosaic strains indicative of some levels of connectivity between populations via gene flow.

Global-scale analyses have suggested that ecological function may define population structure to a greater extent than geographic origin (Fay and Benavides, 2005; Legras et al., 2007). Strains associated with wine appear somewhat distinct from those isolated from distilling, bread making, fermented milk, rice wine, ale and lager, with geographic origin only explaining 28% of variability (Legras et al., 2007). Furthermore, whole-genome analyses of a limited number of strains suggest specific S. cerevisiae populations associated with vineyards, sake and related ferments, although some of these clusters are confounded with geographic origin (Liti et al., 2009; Schacherer et al., 2009). In contrast, a recent rigorous genome-wide population study has provided evidence of gene flow across small distances (<17 km) between distinct populations inhabiting vineyards and oak trees, showing connectivity between ecological niches at small scales (Hyma and Fay, 2013).

Despite these excellent efforts to date, most studies have been drawn from widely dispersed isolates often from different ecological niches, with relatively small sample sizes from any one discrete population (Fay and Benavides, 2005; Aa et al., 2006; Legras et al., 2007; Liti et al., 2009; Schacherer et al., 2009; Wang et al., 2012) and have thus not afforded adequate power to quantify ‘ecological scale’ population processes such as gene flow. Previous work using both microsatellite and RAD-seq analyses shows that a distinct S. cerevisiae population resides in NZ, suggesting that this population is not subject to rampant inward international gene flow (Goddard et al., 2010; Cromie et al., 2013). Therefore, in addition to its geographic isolation, the NZ S. cerevisiae population appears relatively self-contained and thus provides a good population to study the processes we are interested in. Here we analyse close to a thousand S. cerevisiae isolates from four niches across six regions spanning over 1000 km. We quantify both the degree to which this population is structured and go on to quantify the extent to which the various regional populations are connected by gene flow in one of the most comprehensive studies of a microbial metapopulation to date. Lastly, this study sheds light on the connection between farmed (managed) and native ecosystems by examining the relationship between microbial populations residing in vineyards and native NZ forest.

Materials and methods

Sample collection and processing

Six to seven Vitis vinifera var. Sauvignon Blanc vineyards were selected from each of Hawke’s Bay, Martinborough, Nelson, Awatere Valley, Wairau Valley and Central Otago in NZ (Figure 1). Approximately 5 g of soil were aseptically taken from each of these 37 vineyards between 1 and 4 weeks before harvest in mid-March 2011. Ten litres of juice derived from the same vineyards were collected from commercial settling tanks (one vineyard provided juice samples from two pressing tiers, resulting in a total of 38 juice samples). Soil and fruit samples were taken from six native NZ plants located in non-managed native bush reserves within each region (Supplementary Table S2), ranging from 0.1 to 50 km from the vineyard sites, totalling 72 native samples (36 soil and 36 fruit). S. cerevisiae is rare in niches other than in actively fermenting fruit, so equivalent selective culturing methods were employed for all samples to control for the effects of high sugar and ethanol (Mortimer and Polsinelli, 1999; Pretorius, 2000; Xufre et al., 2006; Goddard, 2008; Taylor et al., 2014). An enrichment method emulating fermenting selection pressures was employed for all 147 environmental samples (Mortimer and Polsinelli, 1999; Serjeant et al., 2008). Samples were submerged in 10 ml SelMed media (1% yeast extract, 2% peptone, 10% glucose and 5% ethanol) for six days; 500 μl was then transferred to 10 ml fresh SelMed for four additional days, and then dilutions plated onto YPD (1% yeast extract, 2% peptone, 2% glucose) with 50 μg ml⁻¹ chloramphenicol to retard bacterial growth. All incubation was at 28 ^°C. Up to 94 colonies were taken from each sample and stored in 15% glycerol at −80 ^°C. A total of 7144 individuals were isolated from environmental samples. A natural enrichment of the juice samples was performed by allowing them to ferment spontaneously at 15 °C. In all, 100 ml was concentrated by centrifugation after 21 days, and plated on YPD with 50 μg ml⁻¹ chloramphenicol. Again 94 colonies were isolated from each ferment sample totalling 3572 individuals. All niches were thus evenly sampled and in total 10 716 individuals were collected.

Molecular methods

Genomic DNA was extracted from colonies with 15 μl of 1.25 mg ml⁻¹ Zymolyase solution dissolved in 1.2 M sorbitol and 0.1 M KH₂PO₄ at pH 7.2 and treated with EMA to bind unwanted DNA fragments (Rueckert and Morgan, 2007). We employed a multiplex PCR reaction to distinguish S. cerevisiae, and this also identifies S. uvarum (de Melo Pereira et al., 2010). DNA from eight S. cerevisiae colonies from each sample were initially amplified and scored at 10 unlinked loci as described by Richards et al. (2009) using capillary electrophoresis on an ABI3130XL (Applied Biosystems, Life Technologies, Mulgrave, VIC, Australia). If all eight initial isolates were genotypically identical then no further genotyping was performed for that sample; however, if more than one genotype was recovered, another eight were genotyped until either no new genotypes were seen, or all isolates from the sample had been genotyped. A number of control samples were submitted for the calculation of error rates per allele and per locus as described by Pompanon et al. (2005). To further ascertain the reliability of microsatellite loci amplification and scoring, we analysed an additional 96-well control plate replicating the same strain for DNA extraction, PCR amplification and genotyping.

Data analyses

A ±1-bp error in size calling from run to run variation and plus-A effects was observed and loci were binned accordingly using Genemapper (Version 4). F-statistics, migration estimates (Nm values) and Mantel tests were performed with GenAlEx (Genetic Analyses in Excel) version 6.5 (Peakall and Smouse, 2006; Peakall and Smouse, 2012). Estimates of population diversity were calculated by rarefaction (which controls for unequal sample sizes) using EstimateS (Colwell, 2006). The maximum likelihood outcrossing rates were estimated in Mathematica 7 following the method used by Johnson et al. (2004) that estimates the proportion of matings between spores from the same meiotic event (that is, that are asci mates), and those from independent meiotic events (code available at http://goddardlab.auckland.ac.nz/data-and-code/). Allelic richness was estimated using rarefaction with HP rare, again controlling for unequal sample sizes, based on the lowest number of 94 observed alleles among sampled populations (Kalinowski, 2005).

Population structure was evaluated using the Bayesian clustering method implemented in InStruct, which does not assume Hardy–Weinberg Equilibrium, accounts for inbreeding, and makes no a priori assumptions about the sampling location of the genotypes (Gao et al., 2007). This method estimates the most likely number of populations and assigns genotypes to these probabilistically. Admixture was allowed and the proportion of each genotype’s ancestry in each inferred population was estimated. Three chains of one million MCMC iterations with a burn-in of 10 000 were run for K=1–25. Convergence of the MCMC chain was confirmed using the Gelman–Rubin statistic (Gelman and Rubin, 1992). Analyses of the resulting ancestry profiles evaluating and quantifying the contribution of niche and geographic region to population structure was conducted with ObStruct (Gayevskiy et al., 2014).

Directional migration rates were quantified using the Bayesian coalescent approach implemented in MIGRATE that assumes constant population sizes, random mating, a constant mutation rate and that populations are connected only through migration, not population divergence (Beerli and Felsenstein, 2001; Beerli, 2006, 2009; Beerli and Palczewski, 2010). Mutation-scaled population sizes (θ) were calculated using the number of sampled alleles (Haasl and Payseur, 2010). We employed a Brownian motion allele mutation model with starting estimates of the mutation-scaled migration rate derived from F_ST calculations to estimate all possible migration routes. Chains of one million steps with a burn-in of 50 000 were run with 10 replicates, sampling every 100 steps (Beerli, 2009). The analysis was run in parallel on the NeSI pan cluster at the University of Auckland.

Results

S. cerevisiae presence, abundance and genetic diversity

PCR analyses revealed that 3900 (36%) of the 10 716 isolates were S. cerevisiae. Of the 3780 isolates from spontaneous ferments, 2210 (56%) were S. cerevisiae and 1570 (40%) S. uvarum, revealing the co-existence of a sister Saccharomyces species in this niche. Here we do not pursue the population genetics of S. uvarum. S. cerevisiae was detected in 13 of the 37 vineyard soils, and four and one of the 36 native soil and fruit samples, respectively. The breakdown of samples that yielded S. cerevisiae is shown in Supplementary Tables S1 and S2.

From control samples, two loci (YOR267C and YBR240C) amplified unreliably and were removed from all analyses. Overall the mean error rates per allele and locus were ±4.08% and 4.35%, respectively. In total 850 individuals were genotyped, with 681 isolates from spontaneous ferments, 130 from vineyard soil, 31 from native soil and 8 from native fruits. Identical genotypes within the same sample were collapsed to conservatively account for clonal expansion during enrichment and fermentation meaning the data set was compressed to 380 genotype profiles. Just 11 genotypes matched commercially available wine strains commonly used in NZ (Richards et al., 2009) and were removed from further analyses. This resulted in a final data set comprising 369 microsatellite profiles (Supplementary Data set S1). Interestingly, no genotypes matched a genetically and ecologically diverse set of international strains (Liti et al., 2009) genotyped using the same method (Richards et al., 2009; Goddard et al., 2010).

For the entire data set, a large allelic diversity was detected at all loci. YFR028C and YML091C had the greatest diversity with 25 and 30 alleles, respectively, and all other loci had between 11 and 16 alleles. Overall, 295 different genotypes were recovered and only 38 of these were identified in more than one sample. On average samples that yielded S. cerevisiae contained 4.7 unique genotypes, although most alleles were shared between populations (Supplementary Data set S1). Rarefaction analyses (Chao, 1987; Colwell, 2006) estimate that these genotypes were sampled from an underlying NZ population containing ∼1700 different genotypes (with 95% confidence limits of 1159–2486).

Testing for ecological drivers of population structure

Only four different genotypes were derived from native fruits and soil, and 21 from vineyard soil (Supplementary S1 and S2). This translates to low statistical power to test how the niche of isolation affects population structure. Despite this, observations of identical genotypes between niches within, but not between, regions are striking. For example, genotypes recovered from native soil and fruit in the Martinborough region (Waiohine Gorge) were identical to spontaneous ferment isolates recovered 20 km away in Martinborough vineyards but were not found in other NZ regions; three of the genotypes isolated from vineyard soil in the Wairau Valley were identical to isolates from spontaneous ferments sourced in the same region, with one of these vineyard soil genotypes being identical to an isolate from the spontaneous ferment from the same vineyard. F_ST values between environmental samples (from native soil and fruit and vineyard soil) and spontaneous ferment samples within regions are extremely low (<0.005) and insignificant (P>0.33) with the exception of Martinborough (F_ST=0.046, P=0.003); however, this F_ST value is classed as only representing ‘low’ differentiation (Wright, 1978). There is complete overlap of isolates deriving from all niches in a principal component analysis of genetic distances between genotypes (Supplementary Figure S1) and no significant population differentiation between niches within regions using InStruct and subsequent ObStruct analyses (P>0.119, Supplementary Figure S2). Together, this provides no substantial evidence of an effect of niche on population structure within regions. Some of the S. cerevisiae genotypes contributing to spontaneous ferments may have derived from wineries, as opposed to the ‘environment’ (Bokulich et al., 2013). As all of these wineries reside within the same geographic regions the fruit was collected from, these potentially winery derived genotypes form part of the local population we wish to study. Thus, individuals from various niches within regions comprise homogenous populations, and so we combined all genotypes from different niches within regions to form regional populations for further analyses.

Testing for geographic drivers of population structure

There was significant genetic differentiation, as estimated by pairwise F_ST values, between populations deriving from all six regions (P<0.01), with the exception of those between the Wairau and Awatere Valley’s (F_ST=0.001, P=0.310). These two valleys comprise the wider Marlborough region and were thus combined to represent one population residing in Marlborough. The subsequent pairwise F_ST values between regions are shown in Figure 1. A low albeit significant correlation was observed between genetic and geographic distance (Mantel Test: R²=0.181, P<0.001). Population diversity, as estimated by rarefaction analyses to control for uneven numbers of genotypes, differs by as much as threefold between regions (Table 1). Hawke’s Bay and Marlborough harbour the greatest diversity, whereas Nelson and Central Otago the least. Allelic richness across regions is comparable to estimates within one s.d. of each other (Table 1). All eight loci in all regions are significantly out of Hardy–Weinberg equilibrium (P<0.001), and show strong signals for inbreeding; however, outcrossing rates are significantly above zero within each region (Table 1).

Table 1 Summary of the populations isolated from each region

Full size table

Quantifying geographic population structure

InStruct analyses (Gao et al., 2007) indicate the optimal number of populations, given the data are 16. Examination of the ancestry profile plots (Rosenberg, 2004) resulting from this analysis are indicative of population structure by region to some degree (for example, the blocks of green, red and yellow in Nelson, Central Otago and Martinborough, respectively) and are in agreement with the magnitude of the pairwise F_ST estimates (Figure 1). Subsequent ObStruct analyses revealed that the inferred population structure is significantly correlated with geographic location (R²=0.16, P<0.0001), and this explains about one-sixth of the genetic variability observed. Individuals from the Nelson and Central Otago regions contributed the greatest signal to overall population structure with significant decreases in the R² values observed when these are removed (R²=Δ−0.05 and Δ−0.02, respectively). The R² value remained constant when data from Martinborough were removed but increased when Hawke’s Bay and Marlborough data were independently removed (R²=Δ+0.03 for both). Increases in R² suggest individuals from these regions add noise to any signal for structure (that is, have homogenised not localised populations). Further, canonical discriminant analysis shows that ∼80% of the variation in ancestry profiles can be represented with the first and second axes, suggesting that most of the variation can be visualised in these graphical representations of the data (Supplementary Figure S3). Ancestry profiles from Central Otago and Nelson cluster the most discretely in these plots, recapitulating that populations from these regions provide the strongest signals for differentiation. Pairwise comparisons between regions all significantly differ (P<0.001 or P=0.06 between Hawke’s Bay and Marlborough), but the R² values vary from 0.02 to 0.23 (Supplementary Table S3).

Quantifying population connectivity and migration

Pairwise estimates of migration between the regions (Nm values) using classic methods derived from F_ST values (Hartl and Clark, 1997) suggest that Hawke’s Bay and Marlborough are the most connected, closely followed by Marlborough and Martinborough, and Hawke’s Bay and Martinborough (Figure 1). Nelson and Central Otago share the lowest number of migrants with an estimate of just one per generation (Figure 1). MIGRATE analyses showed an acceptance ratio for each parameter ranging from 0.38 to 0.65, and an effective sample size of approximately two million, suggesting that the chain length was sufficient. The autocorrelation between parameters and the prior was high and estimated to be around 0.96 overall, indicating a lack of information in data. This is reflected in the high confidence intervals surrounding the estimates (Supplementary Table S4). However, consistent patterns between multiple runs were evident, allowing meaningful estimates of gene flow between regions to be made. Inferred mean rates of movement between regions span an order of magnitude ranging from 6 to 63 migrants per generation (Figure 1 and Supplementary Table S4), and show differential inward and outward movements for some regions. Correlating with the classic Nm estimates, and the analyses of population structure, Nelson and Central Otago show greatest isolation with twofold greater rates of outward than inward migration, and show an average of just 51 inward migrants per generation, 3.2-fold less than the overall average inward migration rate of 164 migrants per generation for all other regions (Figure 1 and Supplementary Table S4). Conversely, Marlborough and the Hawke’s Bay, which harbours some of the least distinctive and most diverse populations, experience some of the greatest inward migration rates at an average of 171 migrants per generation, 1.4-fold more than the average inward migration rate (Supplementary Table S4). In line with the low F_ST estimates, a high degree of individuals with shared ancestry from InStruct, and a large proportion of admixed individuals (Figure 1), Marlborough and the Hawke’s Bay are the most connected regions, and experience an average of twofold more migration between these regions than the average overall migration rate. The extent of migration between regions does not correlate with geographic location (P>0.21) showing the difference in the extent of gene flow is not simply a function of distance.

Discussion

We have very few models attempting to generally describe the population biology of microbes. Accurate quantification of short-term population level processes is necessary to understand the likely longer-term evolutionary trajectories of populations (Smadja and Butlin, 2011; Gray and Goddard, 2012), as well as how microbes may interact with other members of the community (Ruxton et al., 2014). We have attempted to make a significant step forward: rather than simply describing this S. cerevisiae population as either structured or not, here we paint a more biologically realistic picture by quantifying the role that geography has in defining structure, and go on to provide quantitative estimates of gene flow between populations residing in different regions.

S. cerevisiae has clearly been isolated many times from managed vineyard ecosystems and ferments of fruit (Lopandic et al., 2008; Liti et al., 2009; Schacherer et al., 2009; Goddard et al., 2010; Gayevskiy and Goddard, 2012; Bokulich et al., 2014). This species is also well reported from native niches in the northern hemisphere (Sniegowski et al., 2002; Wang et al., 2012; Hyma and Fay, 2013), and in the Southern hemisphere has been isolated from exotic Quercus species in NZ, and from Nothofagus in Patagonia (Zhang et al., 2010; Libkind et al., 2011). Here we provide the first report of S. cerevisiae from multiple native tree species in the South Pacific region. Overall, this NZ S. cerevisiae metapopulation displays large genetic variance, compounding evidence that NZ harbours a large and diverse population of this species (Goddard et al., 2010; Gayevskiy and Goddard, 2012; Cromie et al., 2013). Within regions, which typically encompass a radius of under 100 km, there is no compelling evidence for genetic differentiation between niches within managed ecosystems nor more strikingly between managed and native ecosystems. The lack of genetic differentiation between managed and native ecosystems seen here does not permit us to determine whether vineyards or native forests are the sources or sinks of these populations, just that they are connected. Thus, there appears to be a free flow of individuals between these various niches at subregional scales, supporting previous reports from NZ and the United States of America (Goddard et al., 2010; Hyma and Fay, 2013). The inference of little differentiation between niches at regional scales is in contrast to previous reports showing differentiation between isolates from various ecological niches at global scales (Fay and Benavides, 2005; Legras et al., 2007). One explanation for this is the extent of sample effort within any one population. The studies, including this one, reporting a minor effect of niche examined a large number of individuals from specific more localised populations, and in some sense evaluate ‘ecological scale’ processes: it may be that some strains are less well adapted to various niches and that selection will eventually result in their removal. Studies evaluating strains from different geographic and ecological sources only include a handful of strains from any one specific population and unfortunately tend to confound geographic location with niche, but conclude that niche has a stronger role; in some sense these studies might examine populations where selection has possibly had more time to operate. Perhaps, the drivers of population structure differ at different scales? Lastly, it might be that NZ has relatively recently been colonised by only one of the inferred lineages of S. cerevisiae, and this has radiated to all niches. This would also provide a signal for the lack of differentiation between niches. Estimates of the rates of global flux for S. cerevisiae would help disentangle these possibilities.

Whereas populations appear homogeneous within regions, analyses provide compelling evidence for various degrees of genetic differentiation between populations inhabiting major NZ regions. This differentiation is not absolute and there is also a degree of connectivity between regions. This is in line with a previous smaller-scale study with this species that reported both differentiation and connection between regions in the North Island of NZ (Gayevskiy and Goddard, 2012), and this is also echoed at global scales (Liti et al., 2009; Wang et al., 2012). Here all analyses, both classic and more sophisticated Bayesian approaches, converge on the same conclusion. F_ST, Bayesian, ancestry profile and migration analyses show that the populations residing in Nelson and Central Otago are the most distinct and experience the least inward migration. Conversely, Marlborough and Hawke’s Bay have smaller pairwise F_ST values and Bayesian, ancestry profile and migration analyses show that these regions are the most mixed and connected. Marlborough and Hawke’s Bay experience the most inward migration at approximately three times that into Nelson and Central Otago. This is consistent with the higher genetic diversity observed in these regions and implies that they accumulate genetic diversity from around the country.

S. cerevisiae cells and spores are sessile; however, there are a variety of possible vectors that may move this unicellular eukaryote around. S. cerevisiae has been shown to be associated with both wasps and bees and has long been known to be associated with fruit flies (Reuter et al., 2007; Goddard et al., 2010; Stefanini et al., 2012). Recent work provides evidence that certain volatiles released by S. cerevisiae attract Drosophila, and this enhances the likelihood of movement, and potentially facilitates a mutualism between these species (Buser et al., 2014). These insect species easily move over regional scales, and so presumably have some part in the homogenisation of S. cerevisiae within regions. Insects less likely move S. cerevisiae over hundreds of kilometres between regions, although S. cerevisiae may also be associated with birds that can easily cover these distances (Francesca et al., 2012). Humans are also obvious vectors. Indeed, the patterns of separation, and rates of migration in and out the various regions shown here are nicely in line with the flow of fruit and equipment because of the actions of the NZ wine industry. Marlborough and the Hawke’s Bay are the two largest viticultural and winemaking regions in the country, and fruit from other regions is often transferred to them, mirroring the inferred migration of S. cerevisiae into these regions. This national ‘ecological’ scale picture complements and mirrors the global ‘evolutionary’ scale picture revealed for this species: that this is a genetically diverse species that shows some degree of structure and connectivity, and these patterns are consistent with human-influenced dispersal (Fay and Benavides, 2005; Legras et al., 2007).

Although the above interpretation fits nicely with the population patterns observed here, it is important to consider alternate explanations. The connections between populations could instead be indicative of recent divergence events. The NZ wine industry is very young in evolutionary terms and it is possible that S. cerevisiae was introduced to these regions via the introduction of vines and winery equipment such as barrels (Goddard et al., 2010). The patterns observed in this analysis could be explained by the large wine-producing regions of Hawke’s Bay and Marlborough being the source of variation and the outlying regions resulting from founder events with subsequent population expansion and divergence (Hartl and Clark, 1997). The method of migration analysis employed here assumes that population divergence has not occurred, and only invokes migration to explain any similarity in genetic diversity between populations (Beerli, 2009). One issue with a divergence (as opposed to migration) explanation is that source populations must exist before the populations they are proposed to have founded. Whereas Hawke’s Bay is one of the oldest wine-producing regions in NZ, Marlborough is one of the youngest having only been established around 1970. Thus, the divergence hypothesis fits less well, given the vast diversity and admixture observed in the recently established Marlborough region. In addition, whilst it appears that the NZ S. cerevisiae population is reasonably internationally distinct, these patterns of differentiation may also be explained by the inward migration of genotypes from offshore. These explanations are not mutually exclusive and it is likely that population divergence from founding populations is occurring alongside inevitable national and international migration of strains because of the vast movement of fruit, equipment and people by the wine industry.

The demonstration that certain regions have ‘signature’ microbial populations is of relevance to the wine industry. It is often suggested that certain wines reflect their geographic origin, and this is encapsulated in the concept of terroir (Bokulich et al., 2014). Classically, this was thought to largely result from the interaction between specific Vitis vinifera varieties and the local soils, geography and climate; however, there is limited but increasing evidence showing that the microbes that influence vine growth, fermentation and wine style (as S. cerevisiae does) also exhibit regional differentiation (Gayevskiy and Goddard, 2012; Bokulich et al., 2014; Taylor et al., 2014), as we again demonstrate here. Thus, these data further support the concept that there could be a microbial aspect to terroir. Metabolic profiling of regionally defined genotypes is necessary to determine whether the genetic differentiation demonstrated here translates to phenotypes that are relevant to wine, and thus whether microbes contribute to terroir in a predictable and consistent way.

Here we provide a more advanced insight into the population biology of a well-established model microbial eukaryote that has also been biotechnologically harnessed by humans since the dawn of civilisation. We take a significant step towards quantifying these processes by providing the first estimates for metapopulation separation and similarity. We reveal S. cerevisiae population differentiation in NZ at scales over 100 km, with the most signal provided by the more remote regions, but no differentiation within regions, even between populations inhabiting native forests and vineyards. We also show differential migration of this species between regions, and postulate that this may be due, at least in part, to human influence. By quantifying the magnitude of these forces in microbes we begin to provide one crucial aspect of an inclusive framework attempting to more fully integrate ecological and evolutionary processes.

References

Aa E, Townsend JP, Adams RI, Nielsen KM, Taylor JW . (2006). Population structure and gene evolution in Saccharomyces cerevisiae. FEMS Yeast Res 6: 702–715.
Article PubMed Google Scholar
Achtman M . (2008). Evolution, population structure, and phylogeography of genetically monomorphic bacterial pathogens. Annu Rev Microbiol 62: 53–70.
Article CAS PubMed Google Scholar
Anderson JB, Kohn LM . (1998). Genotyping, gene genealogies and genomics bring fungal population genetics above ground. Trends Ecol Evol 13: 444–449.
Article CAS PubMed Google Scholar
Anderson JL, Shearer CA . (2011). Population genetics of the aquatic fungus Tetracladium marchalianum over space and time. PLoS One 6: e15908.
Article CAS PubMed PubMed Central Google Scholar
Beerli P . (2006). Comparison of Bayesian and maximum-likelihood inference of population genetic parameters. Bioinformatics 22: 341–345.
Article CAS PubMed Google Scholar
Beerli P . (2009). How to use MIGRATE or why are Markov chain Monte Carlo programs difficult to use? Bertorelle G, Bruford MW, Hauffe HC, Rizzoli A, Vernesi C (eds), Population Genetics for Animal Conservation. Cambridge University Press: New York, USA, pp 42–79.
Chapter Google Scholar
Beerli P, Felsenstein J . (2001). Maximum likelihood estimation of a migration matrix and effective population sizes in n subpopulations by using a coalescent approach. Proc Natl Acad Sci USA 98: 4563–4568.
Article CAS PubMed PubMed Central Google Scholar
Beerli P, Palczewski M . (2010). Unified framework to evaluate panmixia and migration direction among multiple sampling locations. Genetics 185: 313–326.
Article PubMed PubMed Central Google Scholar
Bokulich NA, Ohta M, Richardson PM, Mills DA . (2013). Monitoring seasonal changes in winery-resident microbiota. PLoS One 8: e66437.
Article CAS PubMed PubMed Central Google Scholar
Bokulich NA, Thorngate JH, Richardson PM, Mills DA . (2014). Microbial biogeography of wine grapes is conditioned by cultivar, vintage, and climate. Proc Natl Acad Sci USA 111: E139–E148.
Article CAS PubMed Google Scholar
Buser CC, Newcomb RD, Gaskett AC, Goddard MR . (2014). Niche construction initiates the evolution of mutualistic interaction. Ecol Lets (in press).
Cavalieri D, McGovern PE, Hartl DL, Mortimer R, Polsinelli M . (2003). Evidence for S. cerevisiae fermentation in ancient wine. J Mol Evol 57 (Suppl 1): S226–S232.
Article CAS PubMed Google Scholar
Chambers PJ, Pretorius IS . (2010). Fermenting knowledge: the history of winemaking, science and yeast research. EMBO Rep 11: 914–920.
Article CAS PubMed PubMed Central Google Scholar
Chao A . (1987). Estimating the population size for capture - recapture data with unequal catchability. Biometrics 43: 783–791.
Article CAS PubMed Google Scholar
Colwell RK . (2006). EstimateS: Statistical estimation of species richness and shared species from samples www.purl.oclc.ord/estimates.
Cromie GA, Hyma KE, Ludlow CL, Garmendia-Torres C, Gilbert TL, May P et al. (2013). Genomic sequence diversity and population structure of Saccharomyces cerevisiae assessed by RAD-seq. G3 (Bethesda) 3: 2163–2171.
Article Google Scholar
de Melo Pereira GV, Ramos CL, Galvão C, Souza Dias E, Schwan RF . (2010). Use of specific PCR primers to identify three important industrial species of Saccharomyces genus: Saccharomyces cerevisiae, Saccharomyces bayanus and Saccharomyces pastorianus. Lett Appl Microbiol 51: 131–137.
CAS PubMed Google Scholar
Di Maio S, Polizzotto G, Di Gangi E, Foresta G, Genna G, Verzera A et al. (2012). Biodiversity of indigenous Saccharomyces populations from old wineries of south-eastern Sicily (Italy): preservation and economic potential. PLoS One 7: e30428.
Article CAS PubMed PubMed Central Google Scholar
Dujon B . (2010). Yeast evolutionary genomics. Nat Rev Genet 11: 512–524.
Article CAS PubMed Google Scholar
Fay JC, Benavides JA . (2005). Evidence for domesticated and wild populations of Saccharomyces cerevisiae. PLoS Genet 1: 0066–0071.
Article CAS Google Scholar
Finlay BJ . (2002). Global dispersal of free-living microbial eukaryote species. Science 296: 1061–1063.
Article CAS PubMed Google Scholar
Francesca N, Canale DE, Settanni L, Moschetti G . (2012). Dissemination of wine-related yeasts by migratory birds. Environ Microbiol Rep 4: 105–112.
Article PubMed Google Scholar
Gao H, Williamson S, Bustamante CD . (2007). A Markov chain Monte Carlo approach for joint inference of population structure and inbreeding rates from multilocus genotype data. Genetics 176: 1635–1651.
Article PubMed PubMed Central Google Scholar
Gayevskiy V, Goddard MR . (2012). Geographic delineations of yeast communities and populations associated with vines and wines in New Zealand. ISME J 6: 1281–1290.
Article CAS PubMed Google Scholar
Gayevskiy V, Klaere S, Knight S, Goddard MR . (2014). ObStruct: a method to objectively analyse factors driving population structure using bayesian ancestry profiles. PLoS One 9: e85196.
Article PubMed PubMed Central Google Scholar
Gelman A, Rubin DB . (1992). Inference from iterative simulation using multiple sequences. Stat Sci 7: 457–472.
Article Google Scholar
Goddard MR . (2008). Quantifying the complexities of Saccharomyces cerevisiae's ecosystem engineering via fermentation. Ecology 89: 2077–2082.
Article PubMed Google Scholar
Goddard MR, Anfang N, Tang R, Gardner RC, Jun C . (2010). A distinct population of Saccharomyces cerevisiae in New Zealand: evidence for local dispersal by insects and human-aided global dispersal in oak barrels. Environ Microbiol 12: 63–73.
Article CAS PubMed Google Scholar
Gray JC, Goddard MR . (2012). Gene-flow between niches facilitates local adaptation in sexual populations. Ecol Lett 15: 955–962.
Article PubMed Google Scholar
Green JL, Bohannan BJM, Whitaker RJ . (2008). Microbial biogeography: from taxonomy to traits. Science 320: 1039–1043.
Article CAS PubMed Google Scholar
Haasl RJ, Payseur BA . (2010). The number of alleles at a microsatellite defines the allele frequency spectrum and facilitates fast accurate estimation of θ. Mol Biol Evol 27: 2702–2715.
Article CAS PubMed PubMed Central Google Scholar
Halkett F, Simon J-C, Balloux F . (2005). Tackling the population genetics of clonal and partially clonal organisms. Trends Ecol Evol 20: 194–201.
Article PubMed Google Scholar
Hanson CA, Fuhrman JA, Horner-Devine MC, Martiny JBH . (2012). Beyond biogeographic patterns: processes shaping the microbial landscape. Nat Rev Microbiol 10: 497–506.
Article CAS PubMed Google Scholar
Härnström K, Ellegaard M, Andersen TJ, Godhe A . (2011). Hundred years of genetic structure in a sediment revived diatom population. Proc Natl Acad Sci USA 108: 4252–4257.
Article PubMed PubMed Central Google Scholar
Hartl DL, Clark AG . (1997) Principles of Population Genetics 3rd edn Sinauer Associates, Inc.: Sunderland, MA, USA.
Google Scholar
Hittinger CT . (2013). Saccharomyces diversity and evolution: a budding model genus. Trends Genet 29: 309–317.
Article CAS PubMed Google Scholar
Hyma KE, Fay JC . (2013). Mixing of vineyard and oak-tree ecotypes of Saccharomyces cerevisiae in North American vineyards. Mol Ecol 22: 2917–2930.
Article PubMed PubMed Central Google Scholar
Johnson LJ, Koufopanou V, Goddard MR, Hetherington R, Schäfer SM, Burt A . (2004). Population genetics of the wild yeast Saccharomyces paradoxus. Genetics 166: 43–52.
Article CAS PubMed PubMed Central Google Scholar
Kalinowski ST . (2005). HP-RARE 1.0: A computer program for performing rarefaction on measures of allelic richness. Mol Ecol Notes 5: 187–189.
Article CAS Google Scholar
Legras JL, Merdinoglu D, Cornuet JM, Karst F . (2007). Bread, beer and wine: Saccharomyces cerevisiae diversity reflects human history. Mol Ecol 16: 2091–2102.
Article CAS PubMed Google Scholar
Libkind D, Hittinger CT, Valeŕio E, Gonca̧lves C, Dover J, Johnston M et al. (2011). Microbe domestication and the identification of the wild genetic stock of lager-brewing yeast. Proc Natl Acad Sci USA 108: 14539–14544.
Article CAS PubMed PubMed Central Google Scholar
Liti G, Carter DM, Moses AM, Warringer J, Parts L, James SA et al. (2009). Population genomics of domestic and wild yeasts. Nature 458: 337–341.
Article CAS PubMed PubMed Central Google Scholar
Lopandic K, Tiefenbrunner W, Gangl H, Mandl K, Berger S, Leitner G et al. (2008). Molecular profiling of yeasts isolated during spontaneous fermentations of Austrian wines. FEMS Yeast Res 8: 1063–1075.
Article CAS PubMed Google Scholar
Martiny JBH, Bohannan BJM, Brown JH, Colwell RK, Fuhrman JA, Green JL et al. (2006). Microbial biogeography: putting microorganisms on the map. Nat Rev Microbiol 4: 102–112.
Article CAS PubMed Google Scholar
McGovern PE, Voigt MM, Glusker DL, Exner LJ . (1996). Neolithic resinated wine. Nature 381: 480–481.
Article CAS Google Scholar
Mercado L, Sturm ME, Rojo MC, Ciklic I, Martínez C, Combina M . (2011). Biodiversity of Saccharomyces cerevisiae populations in Malbec vineyards from the ‘Zona Alta del Río Mendoza’ region in Argentina. Int J Food Microbiol 151: 319–326.
Article PubMed Google Scholar
Mortimer R, Polsinelli M . (1999). On the origins of wine yeast. Res Microbiol 150: 199–204.
Article CAS PubMed Google Scholar
Peakall R, Smouse PE . (2006). GENALEX 6: genetic analysis in excel. Population genetic software for teaching and research. Mol Ecol Notes 6: 288–295.
Article Google Scholar
Peakall R, Smouse P . (2012). GenAlEx 6.5: genetic analysis in excel. Population genetic software for teaching and research – an update. Bioinformatics 28: 2537–2539.
Article CAS PubMed PubMed Central Google Scholar
Pompanon F, Bonin A, Bellemain E, Taberlet P . (2005). Genotyping errors: causes, consequences and solutions. Nat Rev Genet 6: 847–859.
Article CAS PubMed Google Scholar
Pretorius IS . (2000). Tailoring wine yeast for the new millennium: novel approaches to the ancient art of winemaking. Yeast 16: 675–729.
Article CAS PubMed Google Scholar
Prosser JI, Bohannan BJM, Curtis TP, Ellis RJ, Firestone MK, Freckleton RP et al. (2007). The role of ecological theory in microbial ecology. Nat Rev Microbiol 5: 384–392.
Article CAS PubMed Google Scholar
Reuter M, Bell G, Greig D . (2007). Increased outbreeding in yeast in response to dispersal by an insect vector. Curr Biol 17: R81–R83.
Article CAS PubMed Google Scholar
Richards KD, Goddard MR, Gardner RC . (2009). A database of microsatellite genotypes for Saccharomyces cerevisiae. Antonie van Leeuwenhoek 96: 355–359.
Article CAS PubMed Google Scholar
Rosenberg NA . (2004). Distruct: a program for the graphical display of population structure. Mol Ecol Notes 4: 137–138.
Article Google Scholar
Rueckert A, Morgan HW . (2007). Removal of contaminating DNA from polymerase chain reaction using ethidium monoazide. J Microbiol Methods 68: 596–600.
Article CAS PubMed Google Scholar
Ruxton GD, Wilkinson DM, Schaefer HM, Sherratt TN . (2014). Why fruit rots: theoretical support for Janzen’s theory of microbe–macrobe competition. Proc R Soc B Biol Sci 281: 20133320.
Article Google Scholar
Schacherer J, Shapiro JA, Ruderfer DM, Kruglyak L . (2009). Comprehensive polymorphism survey elucidates population structure of Saccharomyces cerevisiae. Nature 458: 342–345.
Article CAS PubMed PubMed Central Google Scholar
Schuller D, Alves H, Dequin S, Casal M . (2005). Ecological survey of Saccharomyces cerevisiae strains from vineyards in the Vinho Verde Region of Portugal. FEMS Microbiol Ecol 51: 167–177.
Article CAS PubMed Google Scholar
Serjeant K, Tang R, Anfang N, Beggs JR, Goddard MR . (2008). Yeasts associated with the New Zealand Nothofagus honeydew system. N Z J Ecol 32: 209–213.
Google Scholar
Smadja CM, Butlin RK . (2011). A framework for comparing processes of speciation in the presence of gene flow. Mol Ecol 20: 5123–5140.
Article PubMed Google Scholar
Sniegowski PD, Dombrowski PG, Fingerman E . (2002). Saccharomyces cerevisiae and Saccharomyces paradoxus coexist in a natural woodland site in North America and display different levels of reproductive isolation from European conspecifics. FEMS Yeast Res 1: 299–306.
CAS PubMed Google Scholar
Stefanini I, Dapporto L, Legras J-L, Calabretta A, Di Paola M, De Filippo C et al. (2012). Role of social wasps in Saccharomyces cerevisiae ecology and evolution. Proc Natl Acad Sci USA 109: 13398–13403.
Article CAS PubMed PubMed Central Google Scholar
Taylor JW, Turner E, Townsend JP, Dettman JR, Jacobson D . (2006). Eukaryotic microbes, species recognition and the geographic limits of species: examples from the kingdom Fungi. Philos Trans R Soc Lond B Biol Sci 361: 1947–1963.
Article PubMed PubMed Central Google Scholar
Taylor MW, Tsai P, Anfang N, Ross HA, Goddard MR . (2014). Pyrosequencing reveals regional differences in fruit-associated fungal communities. Environ Microbiol doi:10.1111/1462-2920.12456.
Tsai IJ, Bensasson D, Burt A, Koufopanou V . (2008). Population genomics of the wild yeast Saccharomyces paradoxus: quantifying the life cycle. Proc Natl Acad Sci USA 105: 4957–4962.
Article CAS PubMed PubMed Central Google Scholar
Van Der Heijden MGA, Bardgett RD, Van Straalen NM . (2008). The unseen majority: soil microbes as drivers of plant diversity and productivity in terrestrial ecosystems. Ecol Lett 11: 296–310.
Article PubMed Google Scholar
Wang Q-M, Liu W-Q, Liti G, Wang S-A, Bai F-Y . (2012). Surprisingly diverged populations of Saccharomyces cerevisiae in natural environments remote from human activity. Mol Ecol 21: 5404–5417.
Article PubMed Google Scholar
Whitaker RJ, Banfield JF . (2006). Population genomics in natural microbial communities. Trends Ecol Evol 21: 508–516.
Article PubMed Google Scholar
Wright S Ed.. (1978) Variability within and among Natural Populations. Evolution and the Genetics of Populations. University of Chicago Press: Chicago.
Google Scholar
Xufre A, Albergaria H, Inácio J, Spencer-Martins I, Gírio F . (2006). Application of fluorescence in situ hybridisation (FISH) to the analysis of yeast population dynamics in winery and laboratory grape must fermentations. Int J Food Microbiol 108: 376–384.
CAS PubMed Google Scholar
Zhang H, Skelton A, Gardner RC, Goddard MR . (2010). Saccharomyces paradoxus and Saccharomyces cerevisiae reside on oak trees in New Zealand: evidence for migration from Europe and interspecies hybrids. FEMS Yeast Res 10: 941–947.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Amisfeild, Ata Rangi, Churton, Coal Pit, Constellation, Delegats, Domain Road, Frey Vineyard, Huia, Misha’s Vineyard, Mt Difficulty, Mt Riley, Neudorf, Palliser, Pernod Ricard, Rippon, Seifried, Seresin, Te Kairanga, Te Whare Ra, Tohu, Trinity Hill, Villa Maria and Vita Brevis for access to their land and providing samples. Soon Lee, Peter Morrison-Whittle and Velimir Gayevskiy assisted with sample collection and processing. We also acknowledge the contribution of the NeSI high-performance computing facilities at the University of Auckland and the staff, particularly Jordi Blasco, at NeSI and Centre for eResearch. New Zealand's national facilities are provided by the New Zealand eScience Infrastructure (NeSI) and funded jointly by NeSI's collaborator institutions and through the Ministry of Business, Innovation and Employment's Infrastructure programme. This work was funded by a University of Auckland doctoral scholarship to SK and grants to MG from the New Zealand Ministry of Business, Innovation and Employment, New Zealand Winegrowers and Plant and Food Research.

Author information

Authors and Affiliations

School of Biological Sciences, University of Auckland, Auckland, New Zealand
Sarah Knight & Matthew R Goddard

Authors

Sarah Knight
View author publications
You can also search for this author in PubMed Google Scholar
Matthew R Goddard
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sarah Knight.

Ethics declarations

Competing interests

The authors declare no conflict of interest.

Additional information

Supplementary Information accompanies this paper on The ISME Journal website

Supplementary information

Supplementary Information (DOC 25 kb)

Supplementary Figure S1 (JPG 226 kb)

Supplementary Figure S2 (JPG 490 kb)

Supplementary Figure S3 (JPG 467 kb)

Supplementary Table S1 (DOC 132 kb)

Supplementary Table S2 (DOC 138 kb)

Supplementary Table S3 (DOC 28 kb)

Supplementary Table S4 (DOC 30 kb)

Supplementary Dataset S1 (XLS 95 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/

Reprints and permissions

About this article

Cite this article

Knight, S., Goddard, M. Quantifying separation and similarity in a Saccharomyces cerevisiae metapopulation. ISME J 9, 361–370 (2015). https://doi.org/10.1038/ismej.2014.132

Download citation

Received: 06 April 2014
Revised: 03 June 2014
Accepted: 11 June 2014
Published: 25 July 2014
Issue Date: February 2015
DOI: https://doi.org/10.1038/ismej.2014.132

This article is cited by

Quantifying the effect of human practices on S. cerevisiae vineyard metapopulation diversity
- Marine Börlin
- Olivier Claisse
- Isabelle Masneuf-Pomarede
Scientific Reports (2020)
Addressing the diversity of Xylodon raduloides complex through integrative taxonomy
- Javier Fernández-López
- M. Teresa Telleria
- María P. Martín
IMA Fungus (2019)
Distribution of Oenococcus oeni populations in natural habitats
- Marc P. G. Lorentzen
- Patrick M. Lucas
Applied Microbiology and Biotechnology (2019)
The origin and adaptive evolution of domesticated populations of yeast from Far East Asia
- Shou-Fu Duan
- Pei-Jie Han
- Feng-Yan Bai
Nature Communications (2018)
Regional microbial signatures positively correlate with differential wine phenotypes: evidence for a microbial aspect to terroir
- Sarah Knight
- Steffen Klaere
- Matthew R. Goddard
Scientific Reports (2015)