Introduction

Understanding how patterns of diversity are established and maintained is fundamental to the ecological characterisation of living systems. Following Whittaker (1972, 1960), diversity is traditionally considered to comprise of three components: local diversity of individual habitats (α diversity) and between-site variation (β diversity) together determine the total diversity of a given system (γ diversity). However, while referring to these definitions, researchers have studied conceptually different phenomena under the umbrella term ‘diversity’. β diversity, in particular, has been variously reported as species turnover or variation, further sub-defined and quantified using different mathematical approaches (Anderson et al., 2010; Tuomisto, 2010). Nevertheless, most authors agree that community similarity, the compositional variation between sites, is an integral aspect of β diversity, and more generally one of the most important parameters in community ecology (Vellend, 2010). To characterise the mechanisms underlying an observed diversity structure, it is essential to quantify and appraise patterns of community similarity.

A multitude of mathematical indices of community similarity have been proposed: as of 2016, the widely used software EstimateS (Colwell, 2013) computes 12 different indices, while the popular microbial ecology toolboxes mothur (Schloss et al., 2009) and phyloseq (McMurdie and Holmes, 2013) provide as many as 37 and 46 measures, respectively. The various available measures capture conceptually different aspects of diversity. Traditional measures, such as the Jaccard (1901) or Bray–Curtis (1957) indices, focus on taxa compositional overlap, quantified directly from taxa count data. More recently, phylogenetically informed indices have become increasingly popular, which, in contrast to census-based metrics, do not treat taxa independently but rather quantify shared evolutionary history between communities (Graham and Fine, 2008; Swenson, 2011). Traditional and phylogenetic metrics may provide complementary insights into the processes driving community composition, particularly since phylogenetic relatedness of taxa is considered a proxy for functional or ecological similarity (Webb et al., 2002).

Apart from analysing diversity patterns, another important approach to characterising ecosystem function focuses on studying interaction networks of ecological or functional associations between taxa directly. Applying graph theory to food webs, mutualist or host-parasite networks and others has revealed an important role for interaction structure in community stability and dynamics (Polis and Strong, 1996; Proulx et al., 2005; Ings et al., 2009). This approach has been particularly fruitful in microbial ecology, where ‘true’ ecological interactions can to a certain extent be inferred from co-occurrence networks of anonymous Operational Taxonomic Units (OTUs; Faust and Raes, 2012; Berry and Widder, 2014). Highly informative taxa co-occurrence networks have been constructed for many ecological systems, including the human body-associated microbiota (Faust et al., 2012) or ocean planktonic communities (Lima-Mendez et al., 2015; Sunagawa et al., 2015), as well as for global, integrated datasets across various habitats (Chaffron et al., 2010).

One main difference between such diversity-based and network-based approaches lies in analysis scope: the latter identify drivers of community structure at the level of individual taxa interactions, while the former reveal compositional patterns at community level. Arguably, both approaches are informative, but they are often pursued independently: it remains challenging to interpret community-level diversity changes in light of taxa-level ecological associations, and vice versa. In this study, we propose to bridge this analysis gap with a set of mathematical indices that quantify community similarity (or β diversity) as the average taxa interaction strength between samples. While our method is applicable to many types of interaction data, we focus on Taxa INteraction-Adjusted indices (TINA), based on taxa co-occurrence data, and Phylogenetic INteraction-Adjusted indices (PINA), based on phylogenetic similarities. In a re-analysis of two publicly available datasets, we show that TINA and PINA capture known diversity patterns better than existing indices, even for very small datasets, and that they can reveal novel and refined biological interpretations.

Materials and methods

In this study, we compared a total of 11 indices of community similarity (listed in Table 1) that fall into three categories: ‘traditional’ taxa count-based indices, phylogenetic indices and our proposed interaction-adjusted indices (Figure 1).

Table 1 Overview of different indices of community similarity used in this study
Figure 1
figure 1

Overview of different approaches to quantifying community similarity. Based on a taxa-sample count table, traditional count-based indices such as Jaccard and Bray–Curtis quantify community similarity from the overlap in taxa composition (upper branch). In contrast, phylogenetic indices such as UniFrac take into account taxa relationships, quantifying community similarity as shared evolutionary history, based on taxa phylogeny (middle branch). Our proposed Taxa INteraction-Adjusted (TINA) and Phylogenetic INteraction-Adjusted (PINA) indices, in contrast, take into account similarities on a taxa co-occurrence network, codified in an interaction similarity matrix C, or in terms of cophenetic phylogenetic distances, represented in a phylogenetic association matrix Φ.

Classical and phylogenetic community similarity indices

In his widely cited ‘comparative study of the floral distribution of parts of the Alps and the Jura’, Paul Jaccard (1901) introduced what is arguably the earliest index of β diversity. For two communites A and B, the classical Jaccard index (JCI) is the relative taxa overlap, that is the ratio of shared taxa among all sampled taxa (see formula in Table 1). In this original fomulation, the JCI is incidence-based, or unweighted: it considers only the presence and absence of taxa, but not their relative abundance ratios. Several abundance-based or weighted variations of the classical Jaccard index have been proposed (Chao et al., 2004); here, we use a straightforward weighted Jaccard (JCW) formulation that describes community similarity as the mean fraction of individuals in shared taxa across both focal samples. While the JCI has proved very versatile and is used for manifold scientific problems beyond biology, an ecology-specific variant that corrects for the characteristics of imperfect sampling has been proposed by Chao et al (2004): Chao’s weighted Jaccard index (JCC) extrapolates the fractions of individuals in unseen shared taxa based on the number of observed shared rare taxa.

One of the most widely used indices in modern community ecology is arguably the Bray-Curtis similarity (BC; Bray and Curtis, 1957), which describes community overlap as the fractional minimum abundance of shared taxa between samples. Somewhat related to BC is the Morisita-Horn overlap index (MH; Horn, 1966), calculated as pairwise multiplicative taxa overlap, adjusted by a per-sample concentration index.

These classical indices and their derivations assess community overlap directly from count data, treating all observed taxa as independent (Figure 1, top branch). Phylogenetic indices, in contrast, consider the (phylogenetic) relationships between taxa and quantify community similarity as shared evolutionary history (Figure 1, middle). Among these increasingly popular indices is UniFrac, which calculates the shared branch length between samples on a phylogenetic tree, either for observed taxa based on incidence (unweighted UniFrac, UFU; Lozupone and Knight, 2005), or based on taxa abundances (weighted UniFrac, UFW; Lozupone et al., 2007).

A novel family of interaction-adjusted community similarity indices

Consider two communities A and B, composed of NA and NB taxa from which a total of nA and nB individuals have been sampled. Next, consider a matrix I that describes pairwise taxa interactions, such that Iij is the interaction between taxa i and j. Manifold types of interactions with different biological meanings, different layers of information and at different levels of curation effort are suitable, such as for example predator–prey relationships, symbiosis, parasitism, mutualism, cross-feeding, resource competition and so on. Here, we consider the case of ecological associations as inferred by taxa co-occurrence networks, constructed from taxa count tables by pairwise association of abundances across samples (Figure 1, bottom). The scale and characteristics of such a co-occurrence interaction matrix IC will depend on the association metric chosen; Faust and Raes (2012) have provided a comprehensive review of different approaches to network construction and interpretation. For example, a taxa abundance correlation network would scale from −1 (avoidance) to +1 (complete association), while other popular association metrics may scale differently. Therefore, it is important to transform the interaction matrix IC to a common scale; we do this by correlating taxa by their pairwise associations to all other taxa in the system (that is, row IC,i* of IC for taxon i) and transforming this into a Pearson similarity:

Thus defined, a transformed co-occurrence matrix C has several desirable properties: (i) it scales from 0 (avoidance) to 0.5 (neutral association) to 1 (complete association); (ii) Cij corresponds to the ‘proximity’ of taxa i and j on the original association network IC; (iii) the transformation generally sharpens and smoothens network structure, amplifying strong associations at the expense of weaker correlations, but association ranks remain mostly unchanged (see Supplementary Figure S1 in Supporting Information).

Given this transformed interaction matrix, we propose to quantify the similarity between communities A and B as the average interaction strength between all taxa observed in A or B. We thus define an incidence-based or unweighted Taxa INteraction-Adjusted index of community similarity (unweighted TINA, TU) as

Likewise, an abundance-based or weighted TINA index (TW) can be defined as weighted average taxa interaction strength, scaled by the geometric mean per-sample weighted taxa interaction strength:

TINA values are 1 for two completely identical communities, but also if all taxa in A and B are perfectly associated. If no taxa are shared, TINA values tend towards 0.5 if taxa interactions are neutral (neither associative nor dissociative) and towards 0 if taxa between A and B show a strong avoidance signal. Thus, TINA resolves non-zero similarities even for pairs of communities that do not share any taxa (which implies zero similarity according to traditional, count-based indices); theoretically, the TINA index for such disparate pairs can even be 1 if all their taxa are perfectly associated.

TINA-like indices can be defined analogously for any kind of interaction data, given that interaction matrices can be transformed similarly to the IC to C transformation described above. This is also true for the special case of phylogenetic ‘interactions’. Consider a phylogenetic tree ϕ of taxa observed in a given system with a cophenetic phylogenetic similarity matrix Iϕ, which can be interpreted as a phylogenetic association network (analogous to IC) and transformed into an association matrix Φ (analogous to the co-occurrence association matrix C). Then, we can define unweighted Phylogenetic INteraction-Adjusted community similarity (unweighted PINA, PU) as

and weighted PINA (PW) as

Human Microbiome Project data analysis

To test the performance of different community similarity indices, we re-analysed two publicly available datasets, provided by the Human Microbiome Project and the TARA Oceans project, respectively. Raw 16S rRNA V35 region sequencing reads of The Human Microbiome Project (2012) were downloaded from the NCBI Sequence Read Archive; metadata was obtained from the HMP data repository (http://hmpdacc.org). Sequences were filtered for chimeric reads using UCHIME (Edgar et al., 2011), aligned to a secondary structure-aware 16S rRNA model using Infernal (Nawrocki and Eddy, 2013), denoised by a global minimum read abundance at 1% tolerance of 4 and clustered into OTUs at 97% average linkage sequence similarity using hpc-clust (Matias Rodrigues and von Mering, 2014), as established previously (Schmidt et al., 2014, 2015). The resulting filtered taxa count table contained 24 717 447 sequences clustered into 27 041 OTUs across 3849 samples. A phylogenetic tree of OTU representatives, selected by minimum average within-OTU sequence distance, was inferred using FastTree2 (Price et al., 2010) with default parameters. Pairwise taxa co-occurence networks for the full dataset and subsets were calculated using taxa-wise Bray–Curtis dissimilarity, weighted Jaccard index, Spearman correlation and a custom R implementation of SparCC (Friedman and Alm, 2012), an adapted correlation metric correcting for spurious associations that has been shown to approximate ‘true’ ecological interactions as simulated using a Generalized Lotka–Volterra (GLV) model (Berry and Widder, 2014).

Simulation of human microbiome samples

Simulations were conducted based on (re-sampled) real count data subsets of the HMP dataset or based on entirely synthetic counts. GLV models were adapted from Berry and Widder (2014), with per-sample carrying capacities set uniformly to total sample size, initial per-sample counts set to sum to 10% of sample carrying capacity, growth rates re-sampled from a uniform distribution on [0,1] unless otherwise specified and further parameters chosen test-specifically. All simulations were repeated 20 times (see online analysis code).

TARA Oceans data processing and analysis

From the TARA Oceans eukaryotic plankton diversity census (De Vargas et al., 2015), we downloaded 18S rRNA V9 tag data organised into an OTU-level taxa count table (http://doi.pangaea.de/10.1594/PANGAEA.843022) and sample metadata (http://doi.pangaea.de/10.1594/PANGAEA.843017). Data per sample were pooled across filter sizes and OTUs containing 30 sequences as well as several orphan samples were removed (see analysis code), yielding a filtered count table of 535 903 407 sequences, 27 448 OTUs and 77 samples for which a SparCC correlation network was computed.

Data and software availability

All analysis code and processed datasets are available online (http://github.com/defleury/Schmidt_et_al_2016_community_similarity; http://meringlab.org/suppdata/2016-community_similarity/).

Results

TINA and PINA provide improved partitioning of human body site-specific microbial communities

From an ecological point of view, the human body appears as little more than a system of distinct microbial habitats (Costello et al., 2012). The Human Microbiome Project (HMP Consortium, 2012) has provided a first comprehensive census of human body-associated microbial communities and their potential functional repertoires. HMP 16S rRNA tag sequencing data are available for 18 habitats from five different body sites, namely oral cavity (9 habitats), skin (4), female urogenital tract (3), airways (1) and gastrointestinal tract (1); see Figure 2 for a full list. One original goal of the HMP, similar to many ecological studies, was to establish how compositional similarity patterns distinguish communities associated to these different habitats. In other words: are body sites distinct from each other in microbial community composition, and which other factors drive compositional variation?

Figure 2
figure 2

Differential partitioning of human body habitat-specific community structure. (a) Partitioning by general body sites. Left, PERMANOVA F statistics for different indices when testing community distance partitioning according to general body site, that is into oral, skin, urogenital, airways and gastrointestinal habitats. Right, histograms of community distances intra-site (orange) and inter-site (blue) for all body sites against each other, oral against other habitats and gastrointestinal against other habitats. The inset illustrates how PERMANOVA F statistics and R2 values are calculated from community distances using the adonis function of the R package vegan (Anderson, 2001; Oksanen et al., 2015). (b) Sub-partitioning of oral habitats (blue), skin habitats (yellow) and urogenital habitats (violet). (c) Pairwise separation by different indices of all pairs of oral habitats, skin habitats against each other and against airways (anterior nares) and urogenital habitats against each other, as detected by the PERMANOVA R2 value (relative variance explained by factor habitat). Note the different colour scales, indicating overall differential partitioning power per body site.

These types of questions are typically addressed by calculating pairwise community distances and then applying multivariate statistical tests to establish how much of the distance matrix structure is explained by a given model. One of the most widely used methods is Anderson’s PERMANOVA (permutational analysis of variance; Anderson, 2001), implemented in the adonis function of the R package vegan (Oksanen et al., 2015), which calculates a pseudo-F statistic on group separation from the sums of squares of inter-group distances over the sums of squares of intra-group distances (see Figure 2a) and then conducts a permutational significance test. Thus, the adonis F statistic, as well as the related R2 statistic (the variance explained by the tested factor) indicate an effect size of multivariate group separation (higher F and R2 values indicate more discriminatory power), while a permutational P value indicates significance. F and R2 statistics have previously been used to benchmark multivariate ecological analyses, for example by Eren et al (2015).

We re-analysed the HMP data using 11 different community similarity indices (Table 1), five of which are count-based (JCI, JCW, JCC, BC and MH), two phylogenetic (UFU, UFW) and four interaction-adjusted (TU, TW, PU and PW). We observed that partitioning of the five general body sites (oral cavity, skin, urogenital tract, airways, gut) by community similarity was by far best for TINA (based on a SparCC correlation network; F=30 455 for TW; F=16 421 for TU) and PINA (PW, 4397; PU, 6917) when compared to all other indices, with JCI providing the weakest discrimination (F=182). This improved partitioning was due to several effects, as indicated by community distance histograms per index (Figure 2a). First, TINA and PINA provided very high overall resolution, distributing pairwise dissimilarities across a broader range on the interval 0 (identical communities) to 1 (complete dissimilarity) than most count-based indices. Second, TINA and PINA assigned very low and sharply distributed intra-group dissimilarities, meaning that samples from the same body sites were on average considered very similar to each other; in contrast, count-based indices showed very sharp and pronounced inter-group dissimilarities, but wider distributions within groups. Finally, intra- and inter-group dissimilarities were generally much clearer separated for TINA than for count-based indices or UniFrac.

To assess the robustness of TINA performance towards the choice of network inference method, we re-tested body site separation based on re-calculated taxa co-occurrences using four different approaches: (i) taxa-wise Bray–Curtis dissimilarity (TW-BC); (ii) taxa-wise weighted Jaccard distance (TW-JCW); (iii) Spearman rank correlation of taxa across samples (TW-SP); and (iv) SparCC correlation (Friedman and Alm, 2012). We found that network inference method had a strong effect on both unweighted and weighted TINA in terms of F and R2 values, but that discriminative power was consistently high. SparCC-based TINA outperformed TW-SP, TW-JCW and TW-BC (in this order), and all TINA metrics outperformed UniFrac and count-based indices (see Supplementary Figure S2). Since by far the strongest separation signal was observed for TINA based on SparCC, we relied on SparCC for network inference in the subsequent tests reported below.

Combined interpretation of TINA and PINA may provide novel biological insights

While habitat partitioning was differentially pronounced, all indices provided significant group separation (P0.001, 999 permutations). Indeed, differences in community composition between body sites – which are highly distinct micro-environments – can be expected to be large, so it is not surprising that they were picked up by all indices. We therefore conducted similar tests on more complicated problems, such as the separation of habitats within a body site (Figure 2b) or of pairs of similar habitats (Figure 2c). TINA provided by far the strongest partitioning of oral and skin habitats, followed by PINA and JCW/JCC. For urogenital sites, in contrast, only few indices provided significant separation at all: unweighted PINA (PU), UFU, UFW and JCI. These trends were consistent with pairwise separability of habitats (Figure 2c), which was highest for TINA in oral and skin, but for PU, UFU and UFW in urogenital sites.

These observations imply that diversity patterns in these habitats are determined by different factors. TINA quantifies community similarity as an overlap in ecological associations of taxa, while PINA and UniFrac focus on shared phylogeny. Thus, it appears that the compositional identity of oral and skin sites is driven by recurring cliques of associated OTUs, as captured by strong co-occurrence signals, while pairwise taxa associations are less important in the urogenital tract, where communities of changing partners are instead filtered by phylogeny, possibly indicating a functional signal. Indeed, we detected a small but significant phylogenetic signal for the distribution of pooled OTU abundances across urogenital sites (Supplementary Figure S3; vaginal introitus: K=0.051, P0.001; mid-vagina: K=0.023, P0.001; posterior fornix: K=0.023, P0.001). However, body subsite was indeed not the predominant determinant of per-sample community composition (Supplementary Figure S4): rather, variation between subjects was higher than within subjects (P0.02 for all tested metrics except, interestingly, unweighted PINA), while even this factor accounted for a maximum of only 4% of variation (unweighted TINA). This is in line with observations in the original HMP study, which moreover established strong associations to other factors such as pH or BMI (HMP Consortium, 2012). Thus, a combined interpretation of TINA and PINA may guide biological interpretation: between-subject community composition appears to be (moderately) determined by taxa co-occurrence, whereas a weak but significant phylogenetic signal may shape differences between vaginal subsites.

TINA captures habitat structure of the human microbiome taxa co-occurrence network

To illustrate how TINA captures ecological taxa interaction structure, how TINA values can be interpreted at the level of individual sample pairs and under which conditions it provides more intuitive results than count-based indices, we selected 16 HMP samples for which Figure 3 shows the taxa co-occurrence network; Supplementary Figure S5 shows the same network, coloured by OTU phylum-level taxonomy. We observed that the network is structured into several habitat-specific clusters of strongly co-occurring OTUs, with slightly less dependence on taxonomy. This is in line with the general observation that (microbial) co-occurrence networks tend to capture ecological signals, which indeed can often be much more subtle than the present body habitat classification (Faust and Raes, 2012).

Figure 3
figure 3

Taxa co-occurrence network for a subset of Human Microbiome Project samples. Sixteen samples from the full HMP dataset were selected as described in the main text, comprising a total of 2671 OTUs for which all pairwise SparCC correlations ≥0.5 are shown as edges in the network. Node size indicates global OTU size, that is the total number of counts per OTU across the full HMP dataset. Node colour indicates OTU dominant habitat, assigned if more than 50% of all OTU abundance was in samples of the same body site. Supplementary Figure S2 shows the same network, coloured by OTU phylum-level taxonomy.

Next, we mapped three examples of sample pairs onto this network to illustrate how TINA takes interaction structure into account to quantify diversity. In the first example (Figure 4a), we consider two samples from the vaginal posterior fornix, which were the most similar pair according to the classical Jaccard index (lowest JCI distance). These samples have a high taxa overlap, while many of their non-shared taxa also share strong co-occurrence signals, so that TINA likewise assigns them a very high similarity. Thus, in this case, TINA and count-based indices agree in considering both samples highly similar.

Figure 4
figure 4

TINA quantifies community similarity from taxa co-occurrence. (a) For two urogenital samples that share a large taxa overlap, both traditional count-based indices (exemplified by the classical Jaccard index, JCI) and TINA assign a low-ranking community distance (as indicated in community distance histograms on the right). The middle panel shows how taxa of these samples map onto the co-occurrence network introduced in Figure 3; blue, taxa unique to sample SN700035995; green, taxa unique to sample SN700096698; blue-green, taxa shared between both samples. (b) For two urogenital samples that do not share any taxa, but whose taxa still share attractive co-occurrence interactions, JCI assigns complete distance (JCI=1), while TINA assigns a relatively low-ranking distance. (c) In the opposite case of two samples from different body sites (skin and oral), which have a significant taxa overlap, but repulsive taxa interactions, JCI assigns a low-ranking, but TINA a very high-ranking distance.

The second case is less trivial (Figure 4b). Here, we consider two urogenital samples from the vaginal introitus, which do not share any taxa at all – count-based indices assign a distance of 1, considering them completely dissimilar. However, the taxa found in these samples share an attractive interaction signal, even though some taxa form part of distinct co-occurence clusters. In this case, TINA provides a more intuitive result by ranking the similarity between two samples from the same habitat (vaginal introitus) relatively high.

Finally, consider the opposite situation (Figure 4c). Here, two samples from different body sites (skin and oral) happen to share several taxa, so that JCI ranks them among the top 17.3% most similar pairs. However, most non-shared taxa belong to distinct co-occurrence clusters, meaning that their pairwise interactions are repulsive, such that TINA assigns a very high dissimilarity to this pair, which, again, is an arguably more realistic picture.

Interaction-adjusted indices provide strong partitioning even for small datasets

The HMP dataset used in this study is comparatively large, comprising 27 041 OTUs across 3849 samples. To test whether the observed trends were robust to dataset scope, we conducted two downsampling experiments (Figure 5 and Supplementary Figure S6). First, we randomly selected between 5 and 50 samples per body site (25–250 samples in total), re-calculated co-occurrence and phylogenetic interaction strengths and assessed body site separation by all 11 indices, at 10 iterations per down-sampling step (Figure 5a; see Supplementary Figure S6A for the corresponding plot on the R2 statistic). We found that even for the smallest tested dataset, TINA and PINA indices provided much clearer partitioning by body site, although ranks by partitioning effect sizes varied across down-sampling iterations. Next, we randomly selected 1000 samples from which we drew 1000 sequences each and down-sampled these to 50 sequences per sample in several steps, at 10 iterations per step, recalculated co-occurrence and phylogenetic interactions and quantified community similarity (Figure 5b and Supplementary Figure S6B). Likewise, we observed that TINA and PINA provided much better separation by body site than all other indices, even at a drastically small size of 50 sequences per sample.

Figure 5
figure 5

TINA and PINA detect strong partitioning by body habitat even for very small datasets. Adonis F statistic for separation by body site according to all 11 tested indices, for datasets randomly down-sampled following two different regimes. Taxa co-occurrences and phylogenetic interactions were re-calculated for every down-sampled dataset. (a) 5, 10, 20 and 50 samples per body site were randomly selected from the full HMP dataset, at 10 iterations per down-sampling step. (b) 1000 randomly selected samples were down-sampled to a depth of 1000 sequences per sample; this dataset was then further down-sampled to 50 sequences per sample in several steps, at 10 random iterations per step. Supplementary Figure S3 shows corresponding plots on R2 values.

TINA captures both habitat preference and taxa interaction signals

To further investigate the behaviour of TINA in different scenarios, we conducted a series of simulations (Figure 6; raw simulation results available in Supplementary Table S1). We randomly selected 25 oral and 25 gastrointestinal samples from the HMP dataset, from which we sampled 200 of the 1000 most abundant OTUs. For this reduced dataset, we inferred Dirichlet multinomial mixture models using the software microbedmm (Holmes et al., 2012); the DMM models perfectly re-discovered the two habitat groups. Next, we iteratively simulated count data by multinomially sampling from these models and re-calculated habitat separation for different community similarity indices, conducting both across-habitat comparisons and respective within-habitat tests as negative controls (Figure 6a). We observed that all tested indices had high discriminative power, but that F values for TINA were up to three orders of magnitude larger than for count-based indices. This is in line with the expectation that DMM inference accounts for both habitat preference and (implicitly) taxa co-abundance signals, both contributing to the taxa co-occurrence signal underlying TINA.

Figure 6
figure 6

TINA captures both habitat preference and taxa interaction signals. (a) Separation of oral and gut samples, as simulated based on a Dirichlet Multinomial Mixture model inferred from raw count data. An example Principal Component Analysis (PCA) of simulation results (left) and partitioning performance for different indices when testing between-habitat and within-habitat separations (negative controls; right). (be) GLV simulations of count data. For each setup, an example interaction matrix, habitat preferences as encoded in a starting count table (oral and gut samples coloured blue and orange) and simulation outcome (as PCA ordination) are shown. The setups correspond to a negative control setting with no habitat preferences and neutral interactions (b), neutral community dynamics with habitat preferences only (c), habitat-naïve conditions with non-neutral interaction structure (d) and a combination of habitat preferences and habitat-specific interactions (e). In (de), light and dark red groups correspond to TINA calculated directly from the (known) interaction matrix. Three high outlying F values were removed in (d). Raw habitat partitioning data is available in Supplementary Table S1.

Thus, to disentangle the contributions of habitat preference and taxa interactions, we simulated count data using GLV models with different starting conditions and parameters. In a first setup, we shuffled raw OTU abundances across samples, regardless of habitat group (oral vs gut), thereby breaking any OTU habitat preference and OTU co-abundance signals. Based on these starting conditions, we simulated community dynamics in an interaction-neutral GLV model. This corresponded to a negative control setup: in the absence of any signal of habitat preference, taxa co-abundance or taxa interaction, none of the tested community similarity indices detected any separation between groups (Figure 6b). Next, we tested neutral community dynamics: we simulated data with habitat preferences, but with neutral taxa interactions by shuffling OTU abundances within habitat groups (thereby breaking co-abundance signals) as input for an interaction-neutral GLV model. In this setup, PERMANOVA F values for TINA were roughly three orders of magnitude stronger than for other indices (Figure 6c), which is in line with the expectation that habitat preferences alone can generate strong taxa co-occurrence and avoidance signals.

In a third GLV setup, we tested the opposite scenario: absence of habitat preferences a priori, but non-neutral and habitat-specific taxa interactions. For this, we split the OTU set into two groups and generated a taxa interaction matrix as follows (see Figure 6d): within-group interactions were derived from a scale-free Barabási network (Barabási and Albert, 1999) with interaction strengths and signs randomly assigned from a uniform distribution; between-group interactions were set to competitive, with uniform noise. Based on this interaction matrix, we performed GLV simulations with starting OTU counts randomly sampled from a Poisson distribution (λ=5) and invariant growth rates. From the resulting count tables, we assigned samples to mock ‘habitat groups’ based on summed OTU group abundances. We observed that partitioning power was very high for all tested indices, with TINA outperforming count-based measures. Interestingly, when we computed TINA directly based on the (known) interaction matrix, it provided even higher F values than SparCC-based TINA, derived from a more indirect count-based co-occurrence signal. It is notable that interaction structure alone, in the absence of a priori habitat preferences, was sufficient to generate clearly discernible sample clusters in our simulations, although there were comparatively large levels of noise observable in partitioning power.

Finally, we investigated a setup with both habitat preferences and non-neutral interactions (Figure 6e). We assigned habitat preferences to OTUs based on their predominant occurrence in either oral or gut samples and generated an interaction matrix as described above with within-habitat structure. We then performed GLV simulations using these interactions and habitat-specific starting counts (assigned from within-group shuffling of raw OTU abundances). We observed that TINA had much higher partitioning power than other indices, and that TINA based directly on the interaction matrix outperformed SparCC-based TINA.

Biogeographical and physicochemical gradients structuring oceanic micro-eukaryote plankton communities are best captured by TINA

The TARA Oceans project has provided a very rich and multifaceted census of the world’s oceans along several geographical and physicochemical gradients. We re-analysed TARA data on micro-eukaryote plankton diversity in the sunlit ocean (De Vargas et al., 2015) to test the performance and versatility of TINA and other indices at representing different ecological signals. The dataset contained 77 samples from 43 stations along a wide geographical gradient (as shown in Figure 7a), taken at two depth levels, subsurface water (SUR) and deep chlorophyll maximum (DCM), and with body size filters ranging from 0.8 to 2000 μm. We calculated pairwise community distances according to JCI, JCW, JCC, BC, MH, TU and TW.

Figure 7
figure 7

TINA captures biogeographical and physicochemical trends in ocean planktonic community composition. (a) TARA Oceans sampling locations, coloured by assigned oceanic world region (middle). The first axes of PCoA ordinations on TINA (left) and Bray–Curtis (right) dissimilarities correlated differentially well with latitude both for subsurface (SUR, orange) and deep chlorophyll maximum (DCM, blue) samples, both for the northern and southern hemisphere. (b) Spearman correlations of six different community distances against absolute difference in latitude (Δlat in rad). (c) PCoA ordinations and PERMANOVA statistics on dataset partitioning by factors region, depth and the region*depth interaction term for JCI, BC and TU.

To test whether community similarity follows a latitudinal gradient, we correlated the first component of a Principal Coordinates ordination (PCoA) to latitude, separately for the northern and southern hemispheres (Figure 7a). We observed that the dominating component of an unweighted TINA-based PCoA correlated well with latitude, both for SUR (ρSpearman=0.75) and DCM (ρ=0.8) water layers. In contrast, an ordination based on BC (the index used in the original study) showed much weaker correlations for the dominating component (ρ=0.15 and ρ=35, respectively). We tested for these effects more systematically by correlating pairwise community distances to absolute differences in latitude between samples (Figure 7b). While all indices provided positive correlations, TU and JCI showed the strongest trend for SUR samples (ρSpearman=0.37), while TU and TW showed the strongest signals for DCM (ρ=0.36 and ρ=0.32). Similar trends were observed for correlations of community distance to geographical distance (Supplementary Table S2). Thus, although JCI performed surprisingly well, TINA outperformed other indices at detecting a biogeographical signal. This can probably be rationalised when considering that taxa co-occurrence is expected to be in part determined by geography, which in turn also correlates with many other ecological parameters like water temperature, irradiation, salinity and so on.

Finally, we tested how well pairwise community distances represented the factors sampling region and depth; Figure 7c shows PCoA ordinations and PERMANOVA statistics for JCI, BC and TU. While all three indices provided significant separation by sampling region, water layer effects were significant only in JCI and TU, while only TU detected any effect of the region*depth interaction term. Moreover, effect sizes as expressed in F statistics and total variance explained by these three factors (summed R2) were considerably higher for TU than for JCI and BC, indicating a much more pronounced partitioning according to these terms.

Discussion

The question of what processes are rendering communities similar or dissimilar to each other is fundamental to the study of diversity. Traditionally, community similarity has been quantified from compositional overlap, considering taxa as independent of each other and describing community structure based on census data. More recently, measures based on additional signals have become increasingly popular, most prominently phylogenetic indices based on shared evolutionary history. Such approaches take into account relationships between taxa, under the assumption that phylogenetic relatedness implies ecological and functional similarity. In this study, we have introduced a set of indices that follow a different rationale: we propose to quantify community similarity in terms of interactions between taxa.

There are several arguments for doing so. First, taxa interactions are a fundamental ecological parameter, at the heart of community ecology: they are important drivers of community assembly, composition and dynamics. Second, it is therefore meaningful to characterise taxa and their relationships based on their interactions: two taxa that are highly similar in terms of their interactions with other taxa can be considered ecologically almost equivalent, they can be assumed to fulfill similar roles in a community. Our approach captures this signal: we argue that it is meaningful to consider communities as similar, which contain ecologically similar taxa. Third, interaction network analysis has proved to be a very powerful tool in unravelling complex community dynamics, but its findings are often difficult to connect to the level of community-level diversity patterns. Our approach may help to bridge this analysis gap, by providing diversity indices that are based on networks and can be interpreted in a network framework. For example, TINA essentially quantifies community distance as the distance on a taxa co-occurrence network. Finally, our approach is versatile: there are many types of ecological interactions, at many different levels, and in principle, interaction-adjusted indices like TINA and PINA can be formulated for all of them.

Several previous studies have investigated adapted versions of classical ecological diversity indices that take into account taxa dissimilarities. Most prominently, the quadratic entropy introduced by Rao (1982) calculates a concentration index, corrected for (known) dissimilarities between taxa; it is defined as the expected dissimilarity between randomly sampled individuals from a community and defaults to the classical Gini-Simpson index if all dissimilarities are zero. Ricotta and Marignani (2007), among others, suggested decompositions of Rao’s quadratic entropy into β-diversity components in the classical ecology sense, that is interpreting it as the expected dissimilarity between individuals randomly sampled from different communities. Such approaches are conceptually and mathematically distinct from our proposed interaction-adjusted indices – the latter are formulated and interpreted as average (dis)similarities on networks, rather than entropy decompositions. Likewise, our approach differs conceptually from established network clustering methods, such as for example Markov Clustering (Enright et al., 2002), which have been successfully applied to many biological problems. These exploit network structure to cluster nodes (that is, in our case, taxa) by similarity, but do not compare pre-defined samples (that is, groups of nodes) in terms of similarity on the network. Similarly, established metrics such as the Topological Overlap Measure (Li and Horvath, 2009) or Anet (Karpinets et al., 2012) compute node similarities (not sample similarities) based on relative node positions in a network; these are analogous to the more basic correlative transformation of the interaction matrix IC to matrix C discussed above (Supplementary Figure S2).

In this study, we have focused on two specific types of interaction-adjusted indices, TINA and PINA, and benchmarked their performance in a re-analysis of two large and complex datasets. TINA by far outperformed all other indices at discriminating human body habitat-specific communities (Figure 2), even for very small datasets (Figure 5). We demonstrated how co-occurrence-based TINA captures a habitat preference signal, a taxa interaction signal and a combination of both in simulations (Figure 6). Moreover, TINA best captured biogeographical trends and partitioning by the factors ‘region’ and ‘water depth’ for microeukaryotic plankton communities (Figure 7). These results can be interpreted in light of how TINA is computed. Taxa co-occurrence networks, on which TINA is based, capture an ‘integrated’ ecological association signal, in quantifying the observable outcome of the interplay of different levels of taxa interactions as patterns of co-abundance and avoidance (Faust and Raes, 2012). Although taxa association networks are not equivalent to ‘true’, ecological taxa interactions, it has recently been shown that they may provide good approximations of true interactions within certain limits (Berry and Widder, 2014). Indeed, they can reveal network structures that are specific to a given type of habitat (as shown for example in Figure 3) or structured according to their response to an ecological gradient or perturbance. Figure 4 illustrates anecdotally how TINA can capture such signals to assign community similarities that are more in line with ecological expectations than count-based indices. The fact that TINA provides such strong separations of body habitats and strong correlations with biogeography and other factors for plankton communities means that these factors have a strong and specific influence on taxa co-occurrence; subsequent diversity analyses should take this into account and interpret accordingly.

Likewise, PINA and other phylogenetic indices such as UniFrac are based on phylogenetic relatedness, operating under the assumption that shared phylogeny implies not only a shared evolutionary history, but similar ecological roles and functional profiles. However, by definition, an inference of ecological similarity from phylogeny is necessarily indirect and imperfect, although it is arguably a valid enough assumption for many phylogenetic clades. In all relevant tests, unweighted and/or weighted PINA, based on pairwise phylogenetic similarities between taxa, outperformed UniFrac, based on shared phylogeny only, even for very small datasets. Interestingly, unweighted PINA and UniFrac were the only indices to detect partitioning by urogenital habitats (Figure 2, Supplementary Figures S3 and S4), a task at which almost all count-based indices, as well as TINA, failed. As discussed above, this is likely due to differential factors and mechanisms shaping community structure in urogenital habitats, compared to other habitats. However, the fact that we did observe differential trends in index performance emphasises the importance of a multifaceted approach: by applying count-based, phylogenetic and interaction-adjusted indices, different aspects of community similarity are quantified, which can be interpreted in context of each other, with the potential to reveal biological insights beyond the scope of mono-dimensional approaches.

One possible drawback of interaction-adjusted indices is that they are not context-invariant: their values will always depend on analysis scope and on the system under consideration, as they vary with varying network structure. While count-based indices will always assume the same similarity for two communities, independently of the remaining dataset, interaction-adjusted indices may change asymptotically when more data are added, simply due to (subtle) changes in network structure. However, this behaviour can indeed also be an asset, for example when comparing multiple datasets in a meta-study. In such a setup, ‘globally’ constructed networks may mitigate dataset-specific noise, introduced for example by sampling methods or limited depth. Likewise, interaction-adjusted indices are not limited to capturing static network architectures, but their flexibility allows to account for conditionally variable networks that are rewired for example over time, gradients or in response to specific changing factors.

In the large arsenal of measures for community similarity and, more generally, β diversity, our proposed family of interaction-adjusted indices provide an important, powerful and versatile alternative. By taking taxa interactions into account, they quantify novel aspects of ‘diversity’, at the very core of community ecology, and may guide biological interpretation of diversity patterns in novel ways.