Quantifying community assembly processes and identifying features that impose them

Stegen, James C; Lin, Xueju; Fredrickson, Jim K; Chen, Xingyuan; Kennedy, David W; Murray, Christopher J; Rockhold, Mark L; Konopka, Allan

doi:10.1038/ismej.2013.93

Download PDF

Original Article
Published: 06 June 2013

Microbial population and community ecology

Quantifying community assembly processes and identifying features that impose them

James C Stegen¹,
Xueju Lin^1,2,
Jim K Fredrickson¹,
Xingyuan Chen³,
David W Kennedy¹,
Christopher J Murray⁴,
Mark L Rockhold³ &
…
Allan Konopka¹

The ISME Journal volume 7, pages 2069–2079 (2013)Cite this article

26k Accesses
1160 Citations
18 Altmetric
Metrics details

Subjects

Abstract

Spatial turnover in the composition of biological communities is governed by (ecological) Drift, Selection and Dispersal. Commonly applied statistical tools cannot quantitatively estimate these processes, nor identify abiotic features that impose these processes. For interrogation of subsurface microbial communities distributed across two geologically distinct formations of the unconfined aquifer underlying the Hanford Site in southeastern Washington State, we developed an analytical framework that advances ecological understanding in two primary ways. First, we quantitatively estimate influences of Drift, Selection and Dispersal. Second, ecological patterns are used to characterize measured and unmeasured abiotic variables that impose Selection or that result in low levels of Dispersal. We find that (i) Drift alone consistently governs ∼25% of spatial turnover in community composition; (ii) in deeper, finer-grained sediments, Selection is strong (governing ∼60% of turnover), being imposed by an unmeasured but spatially structured environmental variable; (iii) in shallower, coarser-grained sediments, Selection is weaker (governing ∼30% of turnover), being imposed by vertically and horizontally structured hydrological factors;(iv) low levels of Dispersal can govern nearly 30% of turnover and be caused primarily by spatial isolation resulting from limited exchange between finer and coarser-grain sediments; and (v) highly permeable sediments are associated with high levels of Dispersal that homogenize community composition and govern over 20% of turnover. We further show that our framework provides inferences that cannot be achieved using preexisting approaches, and suggest that their broad application will facilitate a unified understanding of microbial communities.

Ecosystem size-induced environmental fluctuations affect the temporal dynamics of community assembly mechanisms

Article Open access 18 August 2022

Environmental stress mediates groundwater microbial community assembly

Article 11 January 2024

Environmental heterogeneity determines the ecological processes that govern bacterial metacommunity assembly in a floodplain river system

Article 27 July 2020

Introduction

‘Actual ecological communities are undoubtedly governed by both niche-assembly and dispersal-assembly rules, along with ecological drift, but the important question is: what is their relative quantitative importance?’ (Hubbell, 2001)

Across microbial community ecology, there are many examples of niche-based processes strongly influencing community composition (for example, Gilbert et al., 2012), whereas other studies support neutral or stochastic community assembly (for example, Ofiteru et al., 2010). Clearly, knowledge gained from these and many conceptually similar studies is vital for understanding each interrogated system. Less clear is how to build from this body of work to achieve a more unified understanding of processes that govern the composition of microbial communities.

We suggest one path forward is to work towards realizing Hubbell’s (2001) vision, as summarized above, such that relative process influences can be quantified and compared across microbial systems. To do so, we work within Vellend’s (2010) conceptual framework, which is focused on the influences of Selection, Dispersal, Drift and Speciation. Selection is the result of biotic and abiotic pressures causing variation in reproductive success across individuals and species; Dispersal governs the degree to which individuals move among communities; Drift results from population sizes fluctuating due to chance events; and Speciation can cause differences in species richness among sets of communities that do not exchange individuals through dispersal. On the other hand, Speciation should have little influence within a set of communities where individuals disperse among local communities, known as a ‘metacommunity’ (Leibold et al., 2004). Turnover in community composition within a metacommunity is therefore governed by a combination of Selection, Dispersal and Drift.

Within a metacommunity, the magnitude of Dispersal can range from very limited to very high levels of exchange between communities. Low levels of Dispersal constrain the exchange of organisms among local communities, which can lead to spatial turnover in composition; we refer to this scenario as ‘Dispersal Limitation.’ Dispersal Limitation alone, however, is not enough to cause spatial turnover in composition. Limited exchange of organisms among local communities allows the composition of ecological communities to diverge through stochastic changes in local population sizes. That is, Dispersal Limitation allows Drift to cause much greater spatial turnover in community composition than when Drift acts alone (Hubbell, 2001). On the other hand, high levels of Dispersal can homogenize community composition, thereby causing little turnover in composition (Mouquet and Loreau, 2003; Leibold et al., 2004); we refer to this scenario as ‘Homogenizing Dispersal.’ We note that our concept of Homogenizing Dispersal is similar to ‘mass effects’ and ‘source-sink dynamics’, but we avoid these terms as they invoke additional assumptions and processes (see Leibold et al., 2004); Homogenizing Dispersal simply indicates that dispersal is high enough to cause low turnover by overwhelming other processes.

Quantitatively estimating the influences of Selection, Dispersal Limitation acting in concert with Drift, Drift acting alone and Homogenizing Dispersal is fundamental to our understanding of ecological systems. Such estimates have not, however, been achieved. Instead, recent studies have made progress towards characterizing gradients in the influence of Selection (Chase, 2010; Kraft et al., 2011) and testing ecological neutral theory (Ofiteru et al., 2010; Ricklefs and Renner, 2012). These studies provide important insights, but continued progress requires that we characterize how multiple processes simultaneously govern ecological systems (Gravel et al., 2006; Adler et al., 2007; Vellend, 2010; Stegen and Hurlbert, 2011).

Previous work attempts to characterize the simultaneous influences of ecological processes by partitioning variation in community composition into a fraction explained by environmental variables and a fraction explained by spatial variables (Tuomisto et al., 2003; Cottenie, 2005; Legendre et al., 2009). However, this technique cannot be used to infer the influences of Selection, Dispersal or Drift (Legendre et al., 2009; Gilbert and Bennett, 2010; Jacobson and Peres-Neto, 2010; Smith and Lundholm, 2010; Anderson et al., 2011; Stegen and Hurlbert, 2011).

One limitation of standard analyses that relate ecological community composition to environmental and/or spatial variables is that one must decide a priori which variables are potentially associated with Selection and which potentially result in Dispersal Limitation. For example, we may assume that environmental changes associated with increasing subsurface depth impose Selection on microbial communities. However, there may be unknown hydrological barriers that strongly influence composition by spatially isolating communities. The framework developed here distinguishes between such scenarios by reversing the standard direction of inference. Instead of making a priori decisions, we use ecological patterns to identify which environmental and spatial aspects of our study system impose Selection and which impose Dispersal Limitation.

Our framework relies in part on null models (that is, randomizations) (for example, Chase et al., 2011; Stegen et al., 2012) to identify features that impose Selection or Dispersal Limitation and to quantitatively estimate the influences of Selection, Dispersal Limitation acting alongside Drift, Drift acting alone and Homogenizing Dispersal. Our framework characterizes the spatial structure of both measured and unmeasured environmental variables that impose Selection. In turn, abiotic features that impose Selection can be rigorously distinguished from those that impose Dispersal Limitation. This is true even if key features have not been measured in the field and even if measured environmental variables are related to unknown dispersal barriers, as in the example above.

We apply our analytical framework to subsurface sediments collected from both the Hanford and Ringold geological formations within an unconfined aquifer in southeastern Washington State. These two formations have distinct physical structure, mineralogical composition and geological history (Figure 1; Bjornstad et al., 2009). By comparing inferences across formations and spatial scales, we link ecological processes to geological processes that govern the structure of physical environments.

Materials and methods

We study a bacterial metacommunity associated with subsurface sediments within the unconfined aquifer ∼250 m from the Columbia River in the 300 Area of the Hanford Site in Richland, WA (Figure 1). Our system is characterized by two geological formations; the saturated zone of the coarse-grained Hanford formation ranges from approximately 10 m to 17 m below ground surface and below is the Ringold with finer-grained sediments (Bjornstad et al., 2009). DNA was extracted from sediments as in Lin et al. (2012a), and the V1–V2 region of the 16S rRNA gene was PCR amplified with primers 27F and 338R before pyrosequencing as in Lin et al. (2012b), and processed using QIIME (Caporaso et al., 2010) whereby sequences were clustered as operational taxonomic units (OTUs) defined by 97% sequence similarity (see Supplementary Material). All statistical analyses were carried out in R (R-Core-Team, 2012). Environmental data included a sample’s elevation, horizontal distance from the Columbia River, the elevation of the top of the Ringold formation at its geographic location (Bjornstad et al., 2009) and its percent mud (see Supplementary Material).

Analytical framework development

Turnover in phylogenetic community composition

To infer ecological processes, our analytical framework relies, in part, on phylogenetic turnover, which is the evolutionary distance separating OTUs found in one community from OTUs found in a second community (Graham and Fine, 2008; Stegen et al., 2012). Using phylogenetic turnover to infer ecological processes requires ‘phylogenetic signal’ in OTUs’ optimal habitat conditions (Kraft et al., 2007; Cavender-Bares et al., 2009; Fine and Kembel, 2011), whereby habitat preferences of closely related taxa are more similar to each other than to the habitat preferences of distant relatives (Losos, 2008). We tested for phylogenetic signal to determine whether we could use phylogenetic turnover to make ecological inferences in our system, and to determine the most appropriate metric of phylogenetic turnover.

We found significant phylogenetic signal, but only across relatively short phylogenetic distances (Figure 2), consistent with previous work (Andersson et al., 2010; Diniz-Filho et al., 2010; Hardy et al., 2012; Stegen et al., 2012). It is therefore most appropriate to quantify phylogenetic turnover among closest relatives (Stegen et al., 2012). For this reason, we use the between-community version of the (abundance-weighted) β-mean-nearest taxon distance (βMNTD) (Fine and Kembel, 2011; Webb et al., 2011). βMNTD quantifies the phylogenetic distance between each OTU in one community (k) and its closest relative in a second community (m):

where is the relative abundance of OTU i in community k, n_k is the number of OTUs in k and is the minimum phylogenetic distance between OTU i in community k and all OTUs j in community m. βMNTD was calculated using R function ‘comdistnt’ (abundance.weighted=TRUE; package ‘picante’).

βMNTD can be less than, greater than or equal to the degree of turnover expected when Selection does not influence turnover in community composition. Lower than expected βMNTD should result from environmental conditions constraining community composition by imposing Selection on OTUs. Greater than expected βMNTD should be due to divergent environmental conditions causing each community to be composed of an ecologically distinct set of OTUs.

These expectations assume at least a minor degree of organismal exchange among local communities through deep evolutionary time so that individual communities do not evolve evolutionarily distinct assemblages in situ. This assumption is likely upheld in our system, which is within a single unconfined aquifer (maximum of 54 m separating any two communities) through which groundwater continuously flows and into which the Columbia River annually intrudes (Peterson et al., 2008; Lin et al., 2012b). The degree to which βMNTD deviates from a null model expectation therefore measures the degree to which community composition is limited by Selection on OTU ecological niches.

To quantify the degree to which βMNTD deviates from a null model expectation, we used a randomization that shuffled species names and abundances across the tips of the phylogeny (see Supplementary Material for phylogeny inference methods). After shuffling, βMNTD was recalculated to provide a null value, and repeating the randomization 999 times provided a null distribution. The difference between observed βMNTD and the mean of the null distribution was measured in units of s.d. (of the null distribution) and is referred to as the β-nearest taxon index (βNTI). βNTI values <−2 or >+2 indicate significantly less than or greater than expected phylogenetic turnover, respectively (see also Stegen et al., 2012).

Turnover in OTU composition

Most metrics of turnover in OTU composition provide no information on whether the observed degree of turnover deviates from that expected if community assembly was governed primarily by Drift. One exception is Raup–Crick (Chase et al., 2011). Raup–Crick does not account for OTU relative abundances, however, which carry information useful for understanding ecological processes (Anderson et al., 2011). Here we extend Raup–Crick to consider OTU relative abundances by modifying the procedure of Chase et al. (2011). In short, local communities were assembled probabilistically, where the probability of observing an individual of a given OTU was related to the number of communities occupied by the OTU and the OTU’s relative abundance across all sampled communities. Observed OTU richness and number of individuals were maintained for each community (see Supplementary Material). For a given pair of communities, each was probabilistically assembled 999 times. For each iteration Bray–Curtis dissimilarity was used to quantify compositional turnover, thereby generating a null distribution of Bray–Curtis values. Similar to Chase et al. (2011), we standardize the deviation between empirically observed Bray–Curtis and the null distribution to vary between −1 and +1, and refer to the resulting metric as RC_bray.

We interpret RC_bray values >+0.95 or <−0.95 as significant departures from the degree of turnover expected when Drift acts alone (Chase et al., 2011). In turn, |RC_bray|>0.95 indicates that turnover in community composition is governed primarily by Selection, Dispersal Limitation acting in concert with Drift or Homogenizing Dispersal; RC_bray values between −0.95 and +0.95 are consistent with Drift acting alone (Chase et al., 2011).

We suggest that Dispersal Limitation acting alone should not lead to a significant RC_bray value. For example, consider one homogenous community that is split into two communities with no dispersal between them. For compositional differences to emerge, OTU-specific birth and death rates must differ between the communities such that OTU population sizes differ between the communities. In this case, it is unclear how Dispersal Limitation alone could cause between-community differences in OTU birth and death rates. Drift, however, results from stochastic differences in birth and death rates. If one allows for Drift to occur alongside Dispersal Limitation, pairwise difference in community composition should grow through time and eventually lead to RC_bray>+0.95.

On the other hand, Homogenizing Dispersal may cause less than expected turnover in OTU composition. The expected degree of turnover results from stochastic assembly of local communities by drawing individuals from the regional pool of OTUs (see above). When dispersal between a pair of communities is very high, however, local community assembly is not governed by the composition of the regional pool. For example, take a community that continuously sends large numbers of individuals to a second community. If Selection is relatively weak, the composition of the second community will be determined by the composition of the first community, instead of being determined by the regional pool. Such a scenario should lead to less turnover than when both communities are assembled from the regional pool; that is, RC_bray<−0.95.

Estimating influences of ecological processes

We aim to quantitatively estimate the degree to which spatial turnover in community composition is influenced by Selection, Drift acting alone, Dispersal Limitation acting in concert with Drift and Homogenizing Dispersal. To do so, we take advantageof (i) assuming some dispersal among communities across evolutionary time, non-random phylogenetic turnover arises from Selection (Hardy, 2008) and(ii) non-random turnover in OTU compositioncan result from Selection or Dispersal Limitation (Chase et al., 2011), or as discussed above,from Homogenizing Dispersal. We note thatour framework assumes that all sources of error have a roughly equivalent influence over the quantitative estimates of each process, whereby our estimates should be reasonably close to the true values.

To estimate process influences, we follow a two-step procedure (Figure 3). First, we quantified βNTI for all pairwise community comparisons. As discussed above, a value of |βNTI|>2 indicates that observed turnover between a pair of communities is governed primarily by Selection. In turn, the influence of Selection across a set of local communities was estimated as the fraction of pairwise community comparisons with |βNTI|>2. As a corollary, pairwise comparisons with |βNTI|<2 should be governed by Drift acting alone, Dispersal Limitation acting alongside Drift or Homogenizing Dispersal.

The second step in our procedure quantified RC_bray for pairwise community comparisons that were not governed by Selection (that is, those with |βNTI|<2). Within this set, Dispersal Limitation coupled with Drift should lead to greater than expected turnover (RC_bray>+0.95), whereas Homogenizing Dispersal should lead to less than expected turnover (RC_bray<−0.95). As such, we divided the number of pairwise comparisons with |βNTI|<2 and RC_bray>+0.95 by the total number of all pairwise comparisons. The resulting fraction estimates the influence of Dispersal Limitation acting in concert with Drift. The fraction of all pairwise comparisons with |βNTI|<2 and RC_bray<−0.95 was taken as an estimate for the influence of Homogenizing Dispersal. The fraction of all pairwise comparisons with |βNTI|<2 and |RC_bray|<0.95 estimates the influence of Drift acting alone.

Combining spatial eigenvectors and measured abiotic variables with model selection

In addition to estimating influences of ecological processes, we aim to characterize system features that impose Selection and Dispersal Limitation. To this end, we described spatial and environmental relationships among local communities by combining spatial eigenvector analysis with measured abiotic variables. Spatial eigenvectors describe spatial relationships among communities across a range of spatial scales; the first eigenvector breaks sampling locations into broadly distributed clusters, and subsequent eigenvectors characterize spatial relationships at increasingly fine scales (Borcard and Legendre, 2002; Borcard et al., 2011; Heino et al., 2011).

For spatial eigenvector analyses, we used the R function ‘pcnm’ within package ‘vegan’. The ‘pcnm’ function takes a spatial distance matrix as input. For analyses within the Ringold and Hanford formations, we used geographical locations (Eastings and Northings, Supplementary Table S1) of each well to build the distance matrix, thereby describing spatial relationships in two dimensions. For analyses across both formations (the ‘full system’), we described spatial distances in three dimensions due to increased vertical distances among communities. These three-dimensional Euclidean distances were used to define spatial eigenvectors. Note that spatial eigenvector analysis is robust in one, two or three dimensions (Borcard and Legendre, 2002).

Spatial eigenvectors only describe spatial relationships among sampling locations. As such, some eigenvectors may describe the spatial scales at which dispersal operates, whereas others may be related to the spatial structure of environmental variables (Legendre et al., 2009). In addition to spatial relationships we measured four abiotic variables. However, these measured variables may also simply describe spatial relationships among communities. For example, horizontal distance from the Columbia River may reflect spatial relationships or may reflect different environmental conditions related to spatially structured river water intrusion (Lin et al., 2012b; Stegen et al., 2012). In addition, measured abiotic variables may co-vary with each other and/or with spatial eigenvectors.

To combine all variables and minimize co-variation, we combined measured abiotic variables with spatial eigenvectors using principal components analysis (PCA). The resulting PCA axes (Supplementary Tables S2–S4) were used as independent variables in a model-selection procedure with either βNTI or RC_bray as the dependent variable. Note that three separate sets of PCA axes were characterized: one for the Hanford formation, one for the Ringold formation and one for the full system (Hanford and Ringold formations combined). Labels associated with Hanford formation PCA axes have no relationship to, for example, labels of Ringold formation axes.

To identify features of the system that impose Selection or Dispersal Limitation, we fit statistical models to βNTI and RC_bray using distance-based redundancy analysis (Legendre and Anderson, 1999) (R function ‘capscale’ within package ‘vegan’) combined with a model-selection procedure.Distance-based redundancy analysis takes positive, pairwise community distances as input such that βNTI and RC_bray were each normalized to vary between 0 and 1 before model selection; for each, the absolute magnitude of the minimum (negative) value was added to all values (making all ⩾0), and the resulting values were then divided by their maximum (making all ⩾0 and ⩽1). We used forward model selection (Blanchet et al., 2008) where independent variable significance (α=0.05) was evaluated stepwise and the order of variable evaluation was based on improvement in the model’s adjusted R². Model selection proceeded until the next independent variable was nonsignificant as determined by 1000 permutations (R function ‘ordiR2step’ within package ‘vegan’). Separate model-selection procedures were carried out for the Hanford, the Ringold and the full system, and βNTI and RC_bray were evaluated separately.

The magnitude of βNTI is governed by the influence of Selection relative to the influences of Dispersal Limitation and Drift. Any PCA axes that explain a significant fraction of variation in βNTI should therefore reflect one or more environmental variables that impose Selection. This is true even if a significant PCA axis is unrelated to measured abiotic variables.

If a given PCA axis is significant for βNTI but measured abiotic variables do not load onto it, we consider this PCA axis to be an unmeasured, spatially structured environmental variable that imposes Selection. If measured abiotic variables load heavily onto a significant PCA axis, we consider the axis to be a measured environmental variable that imposes Selection. Furthermore, all PCA axes nonsignificant for βNTI were considered to primarily characterize spatial relationships among communities. This is true even if measured abiotic variables load heavily; measuring a given abiotic variable does not indicate that the variable imposes Selection.

Before RC_bray model selection, we used the βNTI model-selection results to characterize each PCA axis as an unmeasured environmental variable, a measured environmental variable or a spatial variable. Following RC_bray model selection, these variable designations were used (in conjunction with PCA loadings) to interpret the factors imposing Selection or Dispersal Limitation. For example, if a given variable (that is, PCA axis) was not related to βNTI, it was concluded that this variable characterized spatial relationships among local communities. If this same variable was significantly related to RC_bray values, it was identified as characterizing features of the system that impose Dispersal Limitation. To determine if any measured features impose Dispersal Limitation, the PCA loadings on the selected variable were examined.

Comparison of inferences with those from preexisting approaches

We compared insights derived from our analytical framework with those derived from a preexisting approach (similar to, for example, Legendre et al., 2009; Heino et al., 2011). To achieve a direct comparison with our approach, we used the same PCA axes with the same model-selection procedure described above, but with Bray–Curtis dissimilarity as the dependent variable.

Results and Discussion

Quantitative process estimates

Here we provide the first quantitative parsing of ecological processes that influence community assembly (Figure 4a). Across formations and spatial scales, we find that ∼33–57% of turnover in community composition is primarily due to Selection, ∼13–28% of turnover is primarily due to Dispersal Limitation acting in concert with Drift, ∼0–21% of turnover is primarily due to Homogenizing Dispersal and ∼22–29% of turnover is primarily due to Drift acting alone (see Figure 4a for specifics). Preexisting approaches provide no process estimates (Figure 4b).

Our quantitative results are consistent with qualitative conclusions from previous work showing that Selection often has some detectable influence over microbial communities (Andersson et al., 2010; Ofiteru et al., 2010; Stegen et al., 2012). However, we also find that Dispersal Limitation acting in concert with Drift can have a substantial influence over community composition, in contrast to the classic paradigm that ‘all microbes are everywhere’ (see de Wit and Bouvier, 2006; Martiny et al., 2006). This result adds to a growing literature showing an important influence of Dispersal Limitation in microbial systems (for example, Dumbrell et al., 2010; Martiny et al., 2011).

At the other end of the dispersal continuum, Homogenizing Dispersal has a strong influence on community structure in the Hanford formation, but effectively no influence in the Ringold. These contrasting influences of Homogenizing Dispersal make conceptual sense, given the hydrological characteristics of the two formations; in the highly permeable Hanford formation, between-community Dispersal appears to be so high that community composition is often determined primarily by immigration; Dispersal is sufficiently low in the Ringold formation, such that community composition is not strongly influenced by immigration.

We expected to observe a decreased influence of Drift acting alone when considering both formations simultaneously; the larger spatial extent of the system as a whole may increase Dispersal Limitation, and the greater range in environmental conditions may increase the influence of Selection.In contrast to this expectation, Drift alone consistently accounted for ∼25% of turnover in community composition (Figure 4a). It is difficult to compare this result to prior work; to the best of our knowledge, the influence of Drift has neverbeen quantitatively estimated, although Drift is known to have some detectable influence over community assembly in microbial (for example, Ofiteru et al., 2010) and macroorganism (for example, Chase, 2010) systems.

Factors that impose Selection

In addition to quantifying ecological processes, it has long been a goal in ecology to characterize factors that impose Selection (for example, Whittaker, 1967). Previous work, however, has been plagued by the impossibility of measuring all influential environmental variables (Anderson et al., 2011). We overcame this obstacle by running model selection on βNTI, which showed that unmeasured and measured environmental variables impose Selection and that the identity of influential variables changes across formations and spatial scales (Table 1).

Table 1 Summary of model selection results across formations using βNTI and RC_bray

Full size table

In the Ringold formation, βNTI model selection identified one significant PCA axis (PCA7). No measured abiotic variables loaded onto PCA7 (Supplementary Table S2). PCA7 is therefore an unmeasured, spatially structured environmental variable that imposes Selection. The spatial structure of this unmeasured variable is shown in Figure 5. Importantly, model selection for Bray–Curtis identified no significant PCA axes. Relying on preexisting approaches would have therefore provided essentially no information on ecological processes even though the system is heavily governed by Selection (Figure 4).

In the shallower Hanford formation, βNTI model selection identified PCA1 and PCA3 as significant axes. The strongest loadings on PCA1 and PCA3 were distance-from-the-river and subsurface elevation, respectively (Supplementary Table S3). The hydrology of the Hanford formation is strongly influenced by elevation fluctuations of the Columbia River, and the strong loadings of distance-from-the-river and subsurface elevation on PCA1 and PCA3 suggest an important influence of river elevation fluctuations. In particular, the spring runoff-associated river-elevation increase causes water-table rise and intrusion of river water into the subsurface at our site (Peterson et al., 2008; Lin et al., 2012b). Microbial communities near the top of the aquifer may therefore experience saturated or unsaturated hydrological conditions depending on the time of year, and communities further from the river likely experience decreased and less-frequent river intrusion (Lin et al., 2012b). In turn, we hypothesize that in the Hanford formation, Selection causes turnover in community composition due to (i) vertically structured differences in the temporal dynamics of saturation states and (ii) horizontally structured differences in river intrusion. We note that a preexisting approach also selected PCA1 and PCA3 as significant variables, but given the strong spatial structure of these variables, such approaches cannot evaluate whether they impose Selection or Dispersal Limitation.

From analysis of the two formations together (‘full system’), model selection identified three PCA axes associated with βNTI. Examining the loadings of measured abiotic variables on these PCA axes (Supplementary Table S4) suggests that Selection is imposed by factors associated with elevation, such as the percent mud within sediments (Table 1, Supplementary Table S4) and (potentially) between-formation differences in mineralogical composition, age, cementation and vertical layering. In particular, the Ringold is far older (∼ 8.5–3.4 million years old) than the Hanford formation (∼0.015 million years old) with greater cementation and vertical layering (Bjornstad et al., 2009). In addition, measured abiotic variables did not load onto one selected PCA axis, PCA19, suggesting that this axis represents an unmeasured environmental variable that imposes Selection across the full system.

Coupling quantitative process estimates with the βNTI model selection contributes to a system-level conceptual model that contrasts sharply with that derived using a preexisting approach (Figure 4). Our framework suggests that (i) low-energy deposition of fine-grained sediments (as in the Ringold (Bjornstad et al., 2009)) leads to very strong Selection (governing ∼60% of turnover) imposed by an unmeasured, spatially structured environmental variable (Table 1, Supplementary Table S2, Figure 5); (ii) high-energy deposition of coarse-grained sediments (as in the Hanford (Bjornstad et al., 2009)) partially homogenizes abiotic conditions leading to weaker Selection (governing ∼30% of turnover) imposed by spatially structured, hydrology-related environmental factors (Table 1, Supplementary Table S3); and (iii) differences in the physical energy of geological depositional processes can result in between-formation environmental differences that cause turnover due to Selection (governing ∼40% of turnover) (Table 1, Supplementary Table S4). In the case of our particular system, the primary between-habitat environmental differences are related to sediment composition and the degree of vertical layering (Bjornstad et al., 2009); Ringold and Hanford sediments are ∼90% and ∼4% mud, respectively (Supplementary Table S1), and the Ringold has more vertical layering (Bjornstad et al., 2009).

Factors imposing Dispersal Limitation

Fundamental to our understanding of ecological communities is knowledge of the factors that impose Dispersal Limitation. In non-microbial systems, Dispersal Limitation is common, but further inferences are usually limited to the spatial scales across which Dispersal Limitation operates (for example, Legendre et al., 2009). In contrast, we couple model selection for RC_bray with βNTI-basedcharacterization of PCA axes to enable characterization of abiotic features that impose Dispersal Limitation. Key to our approach is that variation in RC_bray can be driven by variation in the strength of Selection or by variation in the magnitude of Dispersal (Chase et al., 2011). PCA axes retained in RC_bray model selection that were not retained in βNTI model selection therefore represent among-community spatial relationships across which dispersal varies (that is, across which Dispersal Limitation is imposed). Inferences drawn from this approach contribute critical elements to our conceptual model (Figure 4a).

At the ‘full-system’ scale, model selection for RC_bray suggests that Dispersal Limitation is imposed, in part, by vertical separation among communities (Table 1, Supplementary Table S4). This is consistent with previous hydrological characterization, suggesting that the fine-grained composition of the Ringold restricts vertical exchange of water between the Ringold and upper Hanford formations (Bjornstad et al., 2009). The disparate geological history of the two formations is therefore indirectly responsible for strong Dispersal Limitation and Selection at the ‘full-system’ scale. Model-selection results further suggest that Dispersal Limitation is also related to horizontal distance from the Columbia River (Table 1, Supplementary Table S4); decreased and less-frequent river intrusion into communities further from the river (Lin et al., 2012b) may therefore cause additional isolation. There also appears to be a number of important unmeasured factors (Table 1, Supplementary Table S4), suggesting that spatially complex hydrological flow paths may strongly influence patterns of organismal exchange among local communities across the ‘full system.’

Within the Hanford formation, RC_bray model selection identified one PCA axis that was not related to βNTI and onto which no measured abiotic variables loaded (Table 1, Supplementary Table S3). This suggests that an unmeasured feature of the Hanford formation imposes Dispersal Limitation. From the available data, it is impossible to know the identity of this unmeasured feature, but as for the ‘full system,’ we hypothesize that spatially structured hydrological flow paths influence the degree to which local communities exchange organisms.

Conclusions

Inferences drawn across our analytical framework provide a unique conceptual model (Figure 4a) linking quantitative estimates of Selection, Dispersal Limitation and Drift to the measured andunmeasured abiotic factors that impose these processes. Our analyses provide a fundamentally deeper understanding of ecological communities and provide inferences that are qualitatively distinct from those derived through traditional analyses (Figure 4).

For a direct comparison with our analyses, we employed an approach similar to that used in previous work. This approach identified PCA axes that are significantly related to Bray–Curtis, but provides no means to determine the processes imposed by significant variables. Previous studies that use approaches similar to this ‘preexisting approach’ appear to assign processes to significant variables (for example, Tuomisto et al., 2003; Cottenie, 2005; Legendre et al., 2009; Heino et al., 2011). Doing so requires one to decide a priori which variables are associated with which ecological process. Identifying the features of a system that impose Selection and those that impose Dispersal Limitation is an empirical question, however, that requires an answer informed by ecological patterns of a given system. Further, preexisting approaches cannot estimate the relative influences of ecological processes or identify unmeasured environmental variables. All these limitations would remain if other preexisting approaches were used, such as using redundancy analysis on raw community composition data (for example, Legendre et al., 2009).

Although we suggest that our framework provides novel insights, it is important to recognize that there are limitations and, as with any new approach, these limitations can be vetted through additional use and simulation-based studies. One limitation, for example, is that the current framework does not parse out sub-classes of Selection, such as competition and trophic interactions. In addition, the framework could be sensitive to factors such as phylogenetic uncertainty and alpha diversity underestimation. These particular factors are partially controlled by confirming phylogenetic signal upfront and using null models that hold observed alpha diversity constant, respectively. Simulation studies are nonetheless needed for a full evaluation.

More generally, the knowledge we seek builds from a revolution in ecological thought that has largely taken place across the last decade. Although often rebuked and rejected (for example, Ricklefs and Renner, 2012, Hubbell’s (2001) neutral theory encouraged broader recognition of Drift and Dispersal. As a consequence, it is now broadly recognized that Selection works alongside Drift and Dispersal (Cottenie, 2005; Gravel et al., 2006; Adler et al., 2007; Legendre et al., 2009; Dumbrell et al., 2010; Chase and Myers, 2011). This is the conceptual foundation from which we work and out of which a unification of community ecology can emerge (Vellend, 2010).

References

Adler PB, HilleRisLambers J, Levine JM . (2007). A niche for neutrality. Ecol Lett 10: 95–104.
Article Google Scholar
Anderson MJ, Crist TO, Chase JM, Vellend M, Inouye BD, Freestone AL et al (2011). Navigating the multiple meanings of β diversity: a roadmap for the practicing ecologist. Ecol Lett 14: 19–28.
Article Google Scholar
Andersson AF, Riemann L, Bertilsson S . (2010). Pyrosequencing reveals contrasting seasonal dynamics of taxa within Baltic Sea bacterioplankton communities. ISME J 4: 171–181.
Article Google Scholar
Bjornstad BN, Horner JA, Vermeul VR, Lanigan DC, Thorne PD . (2009) Borehole Completion and Conceptual Hydrogeologic Model for the IFRC Well Field, 300 Area, Hanford Site. PNNL-18340. Pacific Northwest National Laboratory: Richland, WA.
Book Google Scholar
Blanchet FG, Legendre P, Borcard D . (2008). Forward selection of explanatory variables. Ecology 89: 2623–2632.
Article Google Scholar
Borcard D, Legendre P . (2002). All-scale spatial analysis of ecological data by means of principal coordinates of neighbour matrices. Ecol Model 153: 51–68.
Article Google Scholar
Borcard D, Gillet F, Legendre L . (2011) Numerical Ecology with R. Springer: New York, NY.
Google Scholar
Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD, Costello EK et al (2010). QIIME allows analysis of high-throughput community sequencing data. Nat Methods 7: 335–336.
Article CAS Google Scholar
Cavender-Bares J, Kozak KH, Fine PVA, Kembel SW . (2009). The merging of community ecology and phylogenetic biology. Ecol Lett 12: 693–715.
Article Google Scholar
Chase JM . (2010). Stochastic community assembly causes higher biodiversity in more productive environments. Science 328: 1388–1391.
Article CAS Google Scholar
Chase JM, Kraft NJB, Smith KG, Vellend M, Inouye BD . (2011). Using null models to disentangle variation in community dissimilarity from variation in α-diversity. Ecosphere 2: art24.
Article Google Scholar
Chase JM, Myers JA . (2011). Disentangling the importance of ecological niches from stochastic processes across scales. Philos Transact Royal Soc B Biol Sci 366: 2351–2363.
Article Google Scholar
Cottenie K . (2005). Integrating environmental and spatial processes in ecological community dynamics. Ecol Lett 8: 1175–1182.
Article Google Scholar
de Wit R, Bouvier T . (2006). ‘Everything is everywhere, but, the environment selects’; what did Baas Becking and Beijerinck really say? Environ Microbiol 8: 755–758.
Article Google Scholar
Diniz-Filho JAF, Terribile LC, da Cruz MJR, Vieira LCG . (2010). Hidden patterns of phylogenetic non-stationarity overwhelm comparative analyses of niche conservatism and divergence. Global Ecol Biogeogr 19: 916–926.
Article Google Scholar
Dumbrell AJ, Nelson M, Helgason T, Dytham C, Fitter AH . (2010). Relative roles of niche and neutral processes in structuring a soil microbial community. ISME J 4: 337–345.
Article Google Scholar
Fine PVA, Kembel SW . (2011). Phylogenetic community structure and phylogenetic turnover across space and edaphic gradients in western Amazonian tree communities. Ecography 34: 552–565.
Article Google Scholar
Gilbert B, Bennett JR . (2010). Partitioning variation in ecological communities: do the numbers add up? J Appl Ecol 47: 1071–1082.
Article Google Scholar
Gilbert JA, Steele JA, Caporaso JG, Steinbruck L, Reeder J, Temperton B et al (2012). Defining seasonal marine microbial community dynamics. ISME J 6: 298–308.
Article CAS Google Scholar
Graham CH, Fine PVA . (2008). Phylogenetic beta diversity: linking ecological and evolutionary processes across space in time. Ecol Lett 11: 1265–1277.
Article Google Scholar
Gravel D, Canham CD, Beaudet M, Messier C . (2006). Reconciling niche and neutrality: the continuum hypothesis. Ecol Lett 9: 399–409.
Article Google Scholar
Hardy OJ . (2008). Testing the spatial phylogenetic structure of local communities: statistical performances of different null models and test statistics on a locally neutral community. J Ecol 96: 914–926.
Article Google Scholar
Hardy OJ, Couteron P, Munoz F, Ramesh BR, Pélissier R . (2012). Phylogenetic turnover in tropical tree communities: impact of environmental filtering, biogeography and mesoclimatic niche conservatism. Global Ecol Biogeogr 21: 1007–1016.
Article Google Scholar
Heino J, Grönroos M, Soininen J, Virtanen R, Muotka T . (2011). Context dependency and metacommunity structuring in boreal headwater streams. Oikos 121: 537–544.
Article Google Scholar
Hubbell SP . (2001) The Unified Neutral Theory of Biodiversity and Biogeography. Princeton University Press: Princeton, NJ.
Google Scholar
Jacobson B, Peres-Neto PR . (2010). Quantifying and disentangling dispersal in metacommunities: how close have we come? How far is there to go? Landscape Ecol 25: 495–507.
Article Google Scholar
Kraft NJB, Cornwell WK, Webb CO, Ackerly DD . (2007). Trait evolution, community assembly, and the phylogenetic structure of ecological communities. Am Nat 170: 271–283.
Article Google Scholar
Kraft NJB, Comita LS, Chase JM, Sanders NJ, Swenson NG, Crist TO et al (2011). Disentangling the drivers of β diversity along latitudinal and elevational gradients. Science 333: 1755–1758.
Article CAS Google Scholar
Legendre P, Anderson MJ . (1999). Distance-based redundancy analysis: testing multispecies responses in multifactorial ecological experiments. Ecol Monogr 69: 1–24.
Article Google Scholar
Legendre P, Mi XC, Ren HB, Ma KP, Yu MJ, Sun IF et al (2009). Partitioning beta diversity in a subtropical broad-leaved forest of China. Ecology 90: 663–674.
Article Google Scholar
Leibold MA, Holyoak M, Mouquet N, Amarasekare P, Chase JM, Hoopes MF et al (2004). The metacommunity concept: a framework for multi-scale community ecology. Ecol Lett 7: 601–613.
Article Google Scholar
Lin X, Kennedy D, Peacock A, McKinley J, Resch CT, Fredrickson J et al (2012a). Distribution of microbial biomass and potential for anaerobic respiration in Hanford Site 300 Area subsurface sediment. Appl Environ Microb 78: 759–767.
Article CAS Google Scholar
Lin X, Mckinley J, Resch CT, Lauber C, Fredrickson J, Konopka AE . (2012b). Spatial and temporal dynamics of microbial community in the Hanford unconfined aquifer. ISME J 6: 1665–1676.
Article CAS Google Scholar
Losos JB . (2008). Phylogenetic niche conservatism, phylogenetic signal and the relationship between phylogenetic relatedness and ecological similarity among species. Ecol Lett 11: 995–1003.
Article Google Scholar
Martiny JBH, Bohannan BJM, Brown JH, Colwell RK, Fuhrman JA, Green JL et al (2006). Microbial biogeography: putting microorganisms on the map. Nat Rev Micro 4: 102–112.
Article CAS Google Scholar
Martiny JBH, Eisen JA, Penn K, Allison SD, Horner-Devine MC . (2011). Drivers of bacterial β-diversity depend on spatial scale. Proc Natl Acad Sci USA 108: 7850–7854.
Article CAS Google Scholar
Mouquet N, Loreau M . (2003). Community patterns in source-sink metacommunities. Am Nat 162: 544–557.
Article Google Scholar
Ofiteru ID, Lunn M, Curtis TP, Wells GF, Criddle CS, Francis CA et al (2010). Combined niche and neutral effects in a microbial wastewater treatment community. Proc Natl Acad Sci USA 107: 15345–15350.
Article CAS Google Scholar
Peterson RE, Rockhold ML, Serne RJ, Thorne PD, Williams MD . (2008) Uranium Contamination in the Subsurface Beneath the 300 Area, Hanford site, Washington. PNNL-17034. Pacific Northwest National Laboratory: Richland, WA.
Book Google Scholar
R-Core-Team (2012) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing: Vienna, Austria.
Ricklefs RE, Renner SS . (2012). Global correlations in tropical tree species richness and abundance reject neutrality. Science 335: 464–467.
Article CAS Google Scholar
Smith TW, Lundholm JT . (2010). Variation partitioning as a tool to distinguish between niche and neutral processes. Ecography 33: 648–655.
Article Google Scholar
Stegen JC, Hurlbert AH . (2011). Inferring ecological processes from taxonomic, phylogenetic and functional trait β-diversity. PLoS One 6: e20906.
Article CAS Google Scholar
Stegen JC, Lin X, Konopka AE, Fredrickson JK . (2012). Stochastic and deterministic assembly processes in subsurface microbial communities. ISME J 6: 1653–1664.
Article CAS Google Scholar
Tuomisto H, Ruokolainen K, Yli-Halla M . (2003). Dispersal, environment, and floristic variation of western amazonian forests. Science 299: 241–244.
Article CAS Google Scholar
Vellend M . (2010). Conceptual synthesis in community ecology. Q Rev Biol 85: 183–206.
Article Google Scholar
Webb CO, Ackerly DD, Kembel S . (2011). Phylocom: software for the analysis of phylogenetic community structure and character evolution (with phylomatic and ecoevolve). User’s Manual version 4.2. http://www.phylodiversity.net/phylocom/.
Whittaker RH . (1967). Gradient analysis of vegetation. Biol Rev 42: 207–264.
Article CAS Google Scholar

Download references

Acknowledgements

JCS was supported by a Linus Pauling Distinguished Postdoctoral Fellowship at Pacific Northwest National Laboratory. We thank AH Hurlbert, NJB Kraft and M Vellend for their helpful discussions related to this work. This research was supported by the US Department of Energy (DOE), Office of Biological and Environmental Research (BER), as part of Subsurface Biogeochemistry Research Program’s Scientific Focus Area (SFA) and Integrated Field-Scale Research Challenge (IFRC) at the Pacific Northwest National Laboratory (PNNL). PNNL is operated for DOE by Battelle under contract DE-AC06-76RLO 1830.

Author information

Authors and Affiliations

Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
James C Stegen, Xueju Lin, Jim K Fredrickson, David W Kennedy & Allan Konopka
School of Biology, Georgia Institute of Technology, Atlanta, GA, USA
Xueju Lin
Hydrology Group, Pacific Northwest National Laboratory, Richland, WA, USA
Xingyuan Chen & Mark L Rockhold
Department of Geosciences, Pacific Northwest National Laboratory, Richland, WA, USA
Christopher J Murray

Authors

James C Stegen
View author publications
You can also search for this author in PubMed Google Scholar
Xueju Lin
View author publications
You can also search for this author in PubMed Google Scholar
Jim K Fredrickson
View author publications
You can also search for this author in PubMed Google Scholar
Xingyuan Chen
View author publications
You can also search for this author in PubMed Google Scholar
David W Kennedy
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J Murray
View author publications
You can also search for this author in PubMed Google Scholar
Mark L Rockhold
View author publications
You can also search for this author in PubMed Google Scholar
Allan Konopka
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to James C Stegen.

Ethics declarations

Competing interests

The authors declare no conflict of interest.

Additional information

Supplementary Information accompanies this paper on The ISME Journal website

Supplementary information

Supplementary Information (DOC 3154 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Stegen, J., Lin, X., Fredrickson, J. et al. Quantifying community assembly processes and identifying features that impose them. ISME J 7, 2069–2079 (2013). https://doi.org/10.1038/ismej.2013.93

Download citation

Received: 26 September 2012
Revised: 01 February 2013
Accepted: 11 February 2013
Published: 06 June 2013
Issue Date: November 2013
DOI: https://doi.org/10.1038/ismej.2013.93

Keywords

This article is cited by

Taxonomic dependency and spatial heterogeneity in assembly mechanisms of bacteria across complex coastal waters
- Huizhen Yan
- Dandan Lin
- Kai Wang
Ecological Processes (2024)
Bacterial wilt affects the structure and assembly of microbial communities along the soil-root continuum
- Jinchang Liang
- Chengjian Wei
- Xiaoqiang Wang
Environmental Microbiome (2024)
Environmental stress mediates groundwater microbial community assembly
- Daliang Ning
- Yajiao Wang
- Jizhong Zhou
Nature Microbiology (2024)
Effects of plant tissue permeability on invasion and population bottlenecks of a phytopathogen
- Gaofei Jiang
- Yuling Zhang
- Zhong Wei
Nature Communications (2024)
Changes in soil microbial community and co-occurrence network after long-term no-tillage and mulching in dryland farming
- Zijun Dai
- Jun Fan
- Mingde Hao
Plant and Soil (2024)