The origin of motif families in food webs

Klaise, Janis; Johnson, Samuel

doi:10.1038/s41598-017-15496-1

Download PDF

Article
Open access
Published: 23 November 2017

The origin of motif families in food webs

Scientific Reports volume 7, Article number: 16197 (2017) Cite this article

2613 Accesses
23 Citations
17 Altmetric
Metrics details

Subjects

Abstract

Food webs have been found to exhibit remarkable “motif profiles”, patterns in the relative prevalences of all possible three-species subgraphs, and this has been related to ecosystem properties such as stability and robustness. Analysing 46 food webs of various kinds, we find that most food webs fall into one of two distinct motif families. The separation between the families is well predicted by a global measure of hierarchical order in directed networks—trophic coherence. We find that trophic coherence is also a good predictor for the extent of omnivory, defined as the tendency of species to feed on multiple trophic levels. We compare our results to a network assembly model that admits tunable trophic coherence via a single free parameter. The model is able to generate food webs in either of the two families by varying this parameter, and correctly classifies almost all the food webs in our database. This is in contrast with the two most popular food web models, the generalized cascade and niche models, which can only generate food webs within a single motif family. Our findings suggest the importance of trophic coherence in modelling local preying patterns in food webs.

The geometry of evolved community matrix spectra

Article Open access 29 August 2022

A network simplification approach to ease topological studies about the food-web architecture

Article Open access 17 August 2022

Reinterpreting the relationship between number of species and number of links connects community structure and stability

Article 31 May 2021

Introduction

Food webs are abstract representations of which species consume which others in an ecosystem^1,2,3. In a network-based description, species are represented by nodes and their trophic interactions are represented by directed links, pointing from prey to predator^2,4,5. Much work has been devoted to understanding the origin and meaning of the particular trophic interaction patterns observed in these food webs^6,7,8. Faced with the complexity of whole food webs, many researchers have focused on the interactions among subsets of species, through the analysis of small, connected subgraphs, or motifs ^{9,10,11,12,13}.

The study of local interaction patterns via small network subgraphs¹⁴ first emerged in the study of neuronal and metabolic networks^15,16. The methodology of analyzing the relative prevalence of small subgraphs with respect to a well-posed null model for network assembly remains the main way to gain an understanding of the local structural properties of networks, including food webs^11,12.

In this study we focus on the three-node connected triads of which there are 13 distinct ones (Fig. 1). Many of these triads admit a straightforward interpretation in the context of food-webs¹². The eight triads D1–D8 have double links which correspond to mutual predation between two species. The five single link triads S1–S5 consist of some of the more basic building blocks of food webs. The triad S1 is a simple food chain^3,13, S2 represents omnivory (a predator preying on two species at different trophic levels)^13,17, triad S3 is a cycle (a relatively rare feature)^12,17, and triads S4–S5 represent a predator preying on two species (apparent competition) and two predators sharing a prey species (direct competition), respectively¹¹.

There are several competing hypotheses for the relative prevalence of these subgraphs in food webs. The prevailing hypotheses are that subgraphs emerge as a result of physical constraints (e.g species body size, abstracted by the niche dimension) in the assembly of food networks^9,12, that functional importance leads to the observed structural patterns¹⁸, or that certain stability properties favour some subgraphs over others¹¹.

Attempts to explain subgraph patterns using the two most established food web models, the generalized cascade model¹⁹ and the generalized niche model²⁰, have been unsatisfactory since either model produces food webs with rigid three-species subgraph patterns^9,12 while real food-webs display a far richer array of local preying patterns^11,12. To remedy this disagreement between theory and observation, we study a new food-web model, the Generalized Preferential Preying Model (GPPM)²¹ which can accurately predict the three-species subgraph patterns across a wide array of distinct types of food-webs.

The ecological role of the omnivory triad S2 has been under particular scrutiny^9,11,12,22. Different methodologies, however, have resulted in inconsistent claims about the prevalence of the omnivory triad in empirical food webs^9,11 and its effect on food web stability is still unclear²². In this work we show that omnivory is a crucial feature that motivates a new classification of food webs which could provide insight into the controversy regarding the nature and role of omnivory²³.

Local structural patterns in complex networks are intimately related to global network properties^24,25. A network metric called trophic coherence was recently introduced in order to capture the degree to which the nodes fall neatly into distinct levels^21,26,27,28. In the context of food webs, these are the trophic levels, and high coherence corresponds to the species at one level consuming almost exclusively species at the level immediately below (i.e. low omnivory). Trophic coherence was shown to be a major predictor of the linear stability of ecosystem models, as well as of a number of structural properties of empirical food webs²¹. It has also been related to the numbers of cycles in directed networks, and to the distribution of eigenvalues of associated matrices²⁶.

Trophic coherence is a structural property of directed networks that places constraints on local topological features and on the prevalence of small subgraphs in particular. In this paper we present evidence that the relative prevalence of three-species subgraphs in food webs can be explained by the level of trophic coherence in both empirical and model food webs. This result provides another viewpoint in the debate about the origin of subgraph prevalences in food webs and further evidence of the importance of global organization in food webs²¹.

Methods

Quantifying triad significance

For any given network the exact number N _k of any of the k = 1, …, 13 connected three-node subgraphs (triads, Fig. 1) is influenced by the network size and the degree distribution of the vertices. To test the statistical significance of any given triad k, the empirically observed number N _k is compared against appearances of the same triad in a randomized ensemble of networks serving as a null model¹⁶. This comparison gives a statistical significance or z-score

$${z}_{k}=\frac{{N}_{k}-{\langle {N}_{k}\rangle }_{{\rm{rand}}}}{{\sigma }_{{\rm{rand}}}},$$

(1)

where ${\langle {N}_{k}\rangle }_{{\rm{rand}}}$ and ${\sigma }_{{\rm{rand}}}$ are the randomized ensemble average and standard deviation for triad k, respectively. The z-score of triad k thus measures the deviation of prevalence in the observed network with respect to the null model.

The z-scores of all 13 triads can be summarized in a triad significance profile (TSP) which is a vector ${\bf{z}}=\{{z}_{k}\}$ with components z _k for each triad k. Additionally, the normalized version of the TSP is often used to compare networks of different sizes and link densities¹⁶. This is given by

$$\hat{{\bf{z}}}=\{\frac{{z}_{k}}{\sqrt{\sum _{k=1}^{13}{z}_{k}^{2}}}\}.$$

(2)

The randomization procedure used to obtain the randomized ensemble statistics is a matter of choice. A careful selection of null model is important to discern between real effects and artefacts present in the TSP²⁹. In our analysis, we follow the configuration model (CM) prescription^30,31, and preserve the number of incoming and outgoing links for each node (the degree sequence) while randomizing links via a Markov chain Monte Carlo switching algorithm^15,16. This preserves both the total number of nodes (species) and the links (trophic interactions) in the network. The generation of randomized networks and counts of triads was carried out with mfinder, the algorithm used by Milo et al. in their seminal work on network motifs^15,32.

It is important to emphasize that the TSP is a relative measure of which triads are over- and under-represented with respect to the null model provided by the randomized CM networks. The over-(under-)representation as indicated by a positive (negative) z-score indicates that these triads appear more (less) frequently than in the randomized networks but do not imply an absolute saturation (absence) of said triads. Nevertheless, the TSP is an adequate tool for comparing networks of different sizes and degree distributions.

Comparing networks based on triad significance

To quantitatively compare networks based on their TSP, we use Pearson’s correlation coefficient r between the normalized z-score vectors ${\hat{{\bf{z}}}}^{a}$ and ${\hat{{\bf{z}}}}^{b}$ of networks a and b, respectively^12,16. This is defined as

$$r=\frac{\sum _{k=1}^{n}({\hat{z}}_{k}^{a}-{\bar{z}}^{a})({\hat{z}}_{k}^{b}-{\bar{z}}^{b})}{(n-1){\sigma }_{{\hat{{\bf{z}}}}^{a}}{\sigma }_{{\hat{{\bf{z}}}}^{b}}},$$

(3)

where

$${\bar{z}}^{a}=\frac{\sum _{k=1}^{n}{\hat{z}}_{k}^{a}}{n}$$

(4)

and

$${\sigma }_{{\hat{{\bf{z}}}}^{a}}=\sqrt{\frac{1}{n-1}\sum _{k=1}^{n}{({\hat{z}}_{k}^{a}-{\bar{z}}^{a})}^{2}}$$

(5)

are the mean and the standard deviation of the normalized z-score vectors, a and b specify the networks, k is an index over the triads and n = 13 is the total number of triads.

With this definition a value of r close to 1 indicates that the two networks have very similar TSPs and thus patterns of over- and under-represented triads, a value close to 0 indicates no similarity, and a value close to −1 indicates anti-similarity—i.e. triads over-represented in one network will typically be under-represented in the other (and vice versa).

Comparing the empirical networks is straightforward as we just calculate the r-coefficient pairwise for the z-score vectors of all 46 food webs in our database. On the other hand, for comparison with the model (described in a subsequent section), for each empirical network we fit our food-web model to the data, generate 1000 instances of a model network and then compute the r-coefficient of the empirical z-score vector and the average z-score vector of the model-generated ensemble.

Clustering food webs into families

To uncover clusters of food webs with similar TSPs, we use a hierarchical, agglomerative clustering algorithm³³ based on the Pearson’s correlation coefficient r between TSPs. First, we need to convert this to a distance measure. We define

$$d=\sqrt{\mathrm{2(1}-r)}.$$

(6)

This definition ensures that d is a Euclidean metric³⁴ and we can readily apply hierarchical clustering. We use the UPGMA (average linkage) algorithm³³ to uncover the full cluster hierarchy.

Trophic coherence

Trophic coherence is a topological metric for directed networks that characterizes how layered the network is^21,26. It measures the extent to which we can separate nodes into distinct groups so that any given group receives incoming links from just one other group and has outgoing links to another, different group of nodes. In the context of food webs, it measures the overall tendency of species to feed on multiple distinct trophic levels.

For each species j in the network, we define its trophic level s _j as the average trophic level of its prey, plus one^21,35,

$${s}_{j}=1+\frac{1}{{k}_{j}^{{\rm{in}}}}\sum _{i}{a}_{ij}{s}_{i},$$

(7)

where ${k}_{j}^{{\rm{in}}}={\sum }_{i}{a}_{ij}$ is the number of prey of species j (also known as the in-degree) and a _ij are entries of the adjacency matrix A of the food web. Here the convention is that the directed trophic links point from prey i to predator j.

Because of the recursive nature of Eq. (7), to assign a trophic level to every node in the network two conditions must hold. First, there must be at least one node of zero in-degree – we call such nodes basal; and second, every node in the network must be reachable by a path from at least one basal node. Food webs satisfy both conditions so the linear system defined by Eq. (7) has a unique solution. Without loss of generality we assign s _j = 1 for all basal species, as is the convention in ecology.

We define the trophic distance associated to link a _ij in the network as the difference between the trophic levels at the endpoints, ${x}_{ij}={s}_{j}-{s}_{i}$. Note that this is not a distance in the mathematical sense as it can take negative values. Denote by p(x) the distribution of trophic distances as measured on a network. This will have mean 〈x〉 = 1 by definition and a standard deviation $q=\sqrt{\langle {x}^{2}\rangle -1}$ which we will call the trophic incoherence parameter.

The trophic incoherence parameter is thus a measure of the homogeneity of the distribution p(x). For perfectly coherent networks we have q = 0, which translates to having only integer valued trophic levels and all species feeding on prey only one trophic level below their own. In this case the network is perfectly structured, or layered, as there are distinct groups of herbivores feeding only on basal species, predators feeding only on herbivores and so on. For less coherent networks, q > 0 indicates a less ordered trophic structure, where trophic levels take fractional values and species tend to prey on a broader range of trophic levels. See Fig. 2 for examples of coherent and incoherent food webs.

Model with tunable trophic coherence

Various mathematical models of food webs have been proposed to capture and explain different aspects of food webs^{19,20,36,37,38}, but the main models still fail to capture the full variety of empirically observed structures^12,21,39,40. To reproduce many of the empirical structures²¹, in particular the prevalence of three-species motifs, we propose a model for food webs that allows us to adjust the incoherence parameter q by means of fitting a single free parameter. The model is a generalization of the Preferential Preying Model (PPM) introduced in ref.²¹, with the improvement that it can generate bidirectional links and cycles of higher order, thus producing more realistic networks. In the following we denote by B, N and L the number of basal nodes, total nodes and links in the network respectively, all parameters to be fitted using the empirical network data.

We begin with B basal nodes and no links. We assign trophic levels s = 1 to all basal nodes. We then add N − B new nodes to the network sequentially according to the following rule. For each new node j, pick exactly one prey i at random from among all the existing nodes in the network, thus creating a link from i to j. In doing so, we define the temporary trophic level of node j as ${\hat{s}}_{j}=1+{\hat{s}}_{i}$. After this procedure finishes, we have a network of N nodes and N−B links, and each node has a (temporary) trophic level ${\hat{s}}_{i}$.

Once all N nodes are created, we add the remaining links to the network to bring the expected number of links up to L. The links are chosen among all possible pairs of nodes (i, j) where j is not a basal node (this ensures no incoming links to basal nodes which would make them non-basal), with a probability P _ij that decays with the (temporary) trophic distance ${\hat{x}}_{ij}={\hat{s}}_{j}-{\hat{s}}_{i}$ between them. Specifically, we set

$${P}_{ij}\propto \exp (-\frac{{({\hat{x}}_{ij}-1)}^{2}}{2{T}^{2}}),$$

(8)

where T is a free parameter which sets the degree of prey diversity between multiple trophic levels. This form of probability ensures that the most likely links to be created are between adjacent (temporary) trophic levels. The probabilities in Eq. (8) are normalized so that the expected number of links in the final network is L.

At the end of the network creation procedure the trophic levels need to be recalculated according to Eq. (7) as the addition of new links will have changed the network topology, and the trophic levels in the final network need not correspond to the temporary integer valued trophic levels.

The free parameter T is analogous to temperature in statistical physics and sets the amount of deviation from a perfectly coherent network. For T = 0, only links between adjacent (temporary) trophic levels are allowed which results in the incoherence parameter q = 0. In this case the temporary trophic levels coincide with the actual trophic levels as the addition of links does not change the initially assigned trophic levels. As T is increased, links between a wider range of (temporary) trophic levels become more probable, so we expect q > 0 and increasingly more random networks. A sample dependence of q on T is shown in Fig. 3. The model exhibits a monotonic dependence of the incoherence parameter q on temperature T which provides a basis for fitting the model to empirical food webs given the empirically observed q. We also find that the level of incoherence that is achieved at any given temperature depends on B/N, the ratio of basal species to all species. We will further explore this relationship in the subsequent section.

To fit the model to the food web data, we provide as input the number of basal species B, the number of total species N, and the number of links or trophic interactions L. We then use stochastic root finding to find the value of the temperature parameter T that results in an ensemble of networks whose incoherence parameter q is centred about the empirical incoherence parameter as measured from the food web topology.

Empirical food web data

We study the triad significance profile (TSP) in 46 empirical food webs from a variety of environments: marine, freshwater (river and lake) and terrestrial. Table 1 gives the relevant summary statistics of each food web. The full structure of each food web is included in supplementary information.

Table 1 An alphabetical list of the 46 food webs studied in the paper.

Full size table

Results

Motifs in empirical food webs

The main results are summarized in Figs 4 and 5.

Figure 4 shows the pairwise Pearson correlation coefficients of the triad significance profiles between all 46 food webs. The food webs are arranged by increasing incoherence parameter q so that more coherent food webs are assigned a lower ID. Red hue or warmer colours indicate a larger coefficient, while blue hue or colder colours indicate an anti-correlation in the TSPs.

We see that roughly two families of food webs emerge with similar TSPs. The first family (roughly ID 1-22) is characterized by relatively high coherence (low incoherence parameter q), for which the similarities in the TSPs are very high ($r\ge 0.8$).

There is a second family of food webs, characterized by a high incoherence parameter q, that also show high similarities in their TSPs. Membership to this second family is not as clear as there is a tighter core of food webs belonging to it, with a periphery that only shares some similarities.

To make these ideas more precise, we performed hierarchical clustering of food webs based on a distance metric derived from the pairwise Pearson correlation coefficients. The resulting clusters are shown as a dendrogram in Fig. 4. By choosing a threshold distance d _c, we can group food webs into a number of distinct families based on the similarities of their TSPs. Setting ${d}_{c}=1.1$, we identify two families which include all webs. Family 1 consists of food webs with ID 1–22, 25, 28, 29, 31, 36, 41, 44, 46 whereas Family 2 contains webs with ID 23, 24, 26, 27, 30, 32–35, 37–40, 42, 43, 45. We also observe that these larger families contain within themselves smaller, even more closely related clusters (e.g ID 1–22 corresponding to very low q).

Setting a lower threshold distance could provide a more fine-grained classification of food webs in more than two distinct families but we now show that this coarse classification into two families allows us to qualitatively differentiate food webs based on species preying patterns, specifically the extent of omnivory. To this end, we look closer at the bulk behaviour of the TSPs for the two families. Figure 5 shows the normalized profiles of Family 1 (top) and Family 2 (bottom).

We first consider Family 1. The bulk behaviour of food webs in this family is characterized by an over-representation of triads S1, S4 and S5, as well as an under-representation of triad S2 (with the exception of ID 22 Michigan Lake, ID 29 Florida Bay and ID 46 Everglades Graminoid Marshes). We should find the pattern of under-representation of triad S2 (which represents omnivory) unsurprising, since the majority of food webs belonging to this family have a low incoherence parameter q, which limits the ability of species to feed on multiple different trophic levels. Equally, the over-representation of triads S1, S4 and S5 is to be expected as these are the only three triads out of 13 that can arise in a hypothetical food web with q = 0, which is a value close to the empirical values of q for food webs in this family. The double link triads D1-D8 are all under-represented or close to even, in agreement with our expectations.

We now turn to Family 2. Here the triads S1, S4 and S5 no longer follow a strong pattern of over-representation and the double link triads D1-D8 are not always under-represented. The most distinguishing feature, however, is the bulk over-representation of triad S2 (with the exception of ID 40 Weddell Sea), in stark contrast to Family 1. We will argue that this is the main feature that separates the two food web families.

This pattern of food webs based on the under- or over-representation of triad S2 was alluded to in previous work¹², however it is in disagreement with the predictions of the generalized cascade¹⁹ and niche²⁰ models which can only produce food webs where S2 is over-represented¹². Subsequently, we present results from our model which show that it is possible to change the pattern of under-representation to over-representation of triad S2 by increasing the incoherence parameter q, thus providing evidence that trophic coherence can naturally give rise to two food web families characterized by low or high prevalence of omnivory, respectively.

Comparison between empirical and model networks

We have also investigated the similarities of triad significance profiles between the empirical food webs and model generated food webs. To this end we study the similarity of the TSPs between each empirical food web and an ensemble of model food webs fitted to the data of the empirical one. The results are summarized in Fig. 6. Averaging over an ensemble of 1000 model generated food webs fitted to each empirical food web, we measured the Pearson correlation coefficient between the TSP of the empirical food web and the TSP of the ensemble average. The results show that the model is able to reproduce empirically observed TSPs for the majority of food webs in both families with high accuracy. The model fails to produce accurate TSPs for a number of food webs and sometimes even produces anti-correlated TSPs (r < 0). If we require that r > 0.5, eight food webs are not able to be reproduced accurately by our model, five in Family 1 (ID 31 Lough Hyne, ID 36 Carpinteria Salt Marsh Reserve, ID 41 Caribbean Reef, ID 44 El Verde Rainforest and ID 46 Everglades Graminoid Marshes) and three in Family 2 (ID 23 Bridge Broom Lake, ID 24 Grassland (U.K.) and ID 40 Weddell Sea). Recall that IDs are assigned in the order of increasing q so these particular food webs are unusual members of their respective families in that they tend to have extreme values of q with respect to the majority of networks in either family (higher than average in Family 1 and lower than average in Family 2). Because of the imperfect agreement between q and family membership, our model cannot replicate the structure of these sporadic webs. This suggests that for some food webs information about trophic coherence q may not be enough to reproduce realistic looking TSPs and there may be further mechanisms of prey selection at play¹².

The role of omnivory and basal species

We now focus on the claim that the main difference between the two families of food webs is the relative under- and over-representation of triad S2, or the degree of omnivory in a food web. A prevalence of triad S2 indicates that the species in a food web often feed on different trophic levels, contributing to an increased incoherence parameter q as discussed at the start of this section. A scarcity of triad S2, on the other hand, indicates that species only tend to feed on prey with similar trophic levels, which in turn signals a low incoherence parameter. This suggests a relationship between the z-score of triad S2 and network incoherence as measured by q.

Furthermore, model results (Fig. 3) suggest that a high proportion of basal species to all species, B/N, produces more coherent food webs (i.e. with a low incoherence parameter q). We take this as an additional predictive food web statistic for family membership.

Our findings are summarized in Fig. 7. This is a scatter plot of all 46 food webs where we have plotted the fitted model temperature T and the measured incoherence parameter q against the ratio of basal species to all species B/N. We observe a clear anti-correlation between q and B/N (linear model $q=a\tfrac{B}{N}+b$: $a=-1.06$, $b=0.77$, ${R}^{2}=0.53$, $p=8.47\cdot {10}^{-9}$) that indicates a positive relationship between how coherent a network is (low q) and how many of its species are basal.

We have also coloured the markers of each food web to indicate the level of over- or under-representation of triad S2 as measured by the normalized z-score ${\hat{z}}_{{\rm{S2}}}$. Red circles indicate an over-representation while blue diamonds indicate an under-representation of S2 in the respective food web. Remarkably, based on this measure, we uncover two clusters of food webs corresponding roughly to the two families based on TSP similarities. The first cluster is once again characterized by a high incoherence parameter q as well as a low ratio of basal species to all species B/N. The second cluster is characterized by a low incoherence parameter and a high ratio of basal species to all species. The only exceptions are six food webs in the first family (ID 20 Coweeta (1), ID 21 Martins Stream, ID 31 Lough Hyne, ID 36 Carpinteria Salt Marsh Reserve, ID 41 Caribbean Reef and ID 44 El Verde Rainforest), four of which correspond to food webs poorly matched by our model (Fig. 6). We conclude that, indeed, the main difference between the two families is the relative role of triad S2 as already observed in the bulk behaviour of the TSPs in Fig. 5.

Finally, we study whether our model exhibits a similar transition from a relatively S2-poor to an S2-rich state which would explain the relatively good agreement between empirical and model generated TSPs for the two families (Fig. 6). We find that for a given basal species ratio B/N there exists a critical temperature ${T}_{c}$, and thus a critical incoherence parameter q _c, which signifies such a transition. For T (and q) below these critical values, the model generates networks where S2 is under-represented, while for values above critical, the networks generated have either an even or an over-represented number of S2 triads. We include the transition line of the two regimes in Fig. 7 for an ensemble of 100 model networks with N = 100 species and an average (non-basal) degree $\langle k\rangle =L/(N-B)=10$. Networks with q below the line show an under-representation of S2 triads, while networks with q above the line show an over-representation as measured by ${\hat{z}}_{{\rm{S2}}}$.

Remarkably, the model results are in very good agreement with the empirical data despite the fact that both the network size N and the average degree k vary considerably between the empirical food webs. Almost all food webs with an under-represented number of S2 triads fall below the transition line of the model while those with an over-represented number reside above the line.

These findings suggest that the two families of food webs differ in the degree of omnivory present as measured by the prevalence of triad S2 which is itself intimately related to the incoherence parameter q. Interestingly, based on the strong anti-correlation between q and B/N, either parameter is a strong determinant of family membership. To our knowledge, the GPPM is the first food-web model able to reproduce triad significance profiles consistent with empirical observations. The ability to produce model networks belonging to either of the two families suggests that the parameters q and B/N are both important in the mathematical modelling of food webs and may, in fact, be fundamental for understanding local preying patterns in food webs.

Discussion

Our investigation of trophic interaction patterns in food webs has revealed significant correlations between the degree of omnivory, hierarchical organization of trophic species and the density of basal species.

The analysis of local trophic interactions via triad significance profiles in empirical food webs reveals two distinct families of food webs characterized by a relatively low or high incoherence parameter respectively. While certain differences across families of food webs based on their TSPs have been observed before¹², these are not predicted by any existing food web models, calling into question their use as null models given the academic significance attached to food web motifs^{9,10,11,12,13}. Trophic coherence provides a network theoretic metric that enables us to classify and predict the relative prevalence of such motifs.

We have shown qualitatively that the the main difference between the two food web families is the extent of omnivory, as measured by the over- or under-representation of triad S2 (the feed-forward loop). This classification of food webs into two families according to the extent of omnivory is at odds with previous claims that omnivory occurs more often than one would expect to happen by chance across most food webs¹². On the other hand, the existence of these families may be related to different ways omnivory emerges in food webs and influences their stability^21,22,23. We have tested our prediction for the onset of omnivory using a new model for generating synthetic food webs with a given trophic coherence. We find that the model exhibits a transition from an under-representation of omnivory to an over-representation of omnivory as a function of trophic coherence. Our model results fit the food web data very well, providing evidence of the importance of trophic coherence as well as the basal species density in modelling realistic trophic interactions. We would like to emphasize that these findings are remarkably robust between food webs originating from vastly different habitats.

This work has expanded on the importance of trophic coherence in predicting structural features in food webs²¹, but the biological origin of trophic coherence remains elusive. Basal species density and its effect of suppressing highly incoherent structures in both empirical and model food webs may provide some clues. All other things being equal, a higher proportion of autotrophs in a food web will necessarily mean that a higher proportion of consumers will feed on these basal species. In turn, this would have a dampening effect on the formation of long food chains in the trophic hierarchy and hence fewer possibilities for a varied diet of species at the top. Figure 2 exemplifies how this hypothesis could lead to very different food web structures. Established food web models do not treat basal species density as a predictor for emergent structure but rather as an emergent property itself. On the other hand, most food webs have been found to be significantly more trophically coherent than a random graph with the same density of basal species, so there must be other coherence-inducing mechanisms at play²⁶. Further work is needed to elucidate the reasons behind this property of ecosystem structure.

References

Paine, R. T. Food web complexity and species diversity. The Am. Nat. 100, 65–75, https://doi.org/10.1086/282400 (1966).
Article Google Scholar
Pimm, S. L. Food Webs (Springer Netherlands, 1982).
Cohen, J., Briand, F. & Newman, C. Community food webs: data and theory, vol. 20 of Biomathematics (Springer-Verlag, Berlin, Germany, 1990).
Dunne, J. A., Williams, R. J. & Martinez, N. D. Network structure and robustness of marine food webs. Mar. Ecol. Prog. Ser. 273, 291–302, https://doi.org/10.3354/meps273291 (2004).
Article ADS Google Scholar
Drossel, B. & McKane, A. J. Modelling food webs. In Handbook of Graphs and Networks, 218–247 (Wiley-VCH Verlag GmbH & Co. KGaA) https://doi.org/10.1002/3527602755.ch10 (2005).
May, R. M. Stability and complexity in model ecosystems, vol. 6 (Princeton University Press, 1973).
Pimm, S. L., Lawton, J. H. & Cohen, J. E. Food web patterns and their consequences. Nat. 350, 669–674, https://doi.org/10.1038/350669a0 (1991).
Article ADS Google Scholar
Garlaschelli, D., Caldarelli, G. & Pietronero, L. Universal scaling relations in food webs. Nat. 423, 165–168, https://doi.org/10.1038/nature01604 (2003).
Article ADS CAS MATH Google Scholar
Camacho, J., Stouffer, D. & Amaral, L. Quantitative analysis of the local structure of food webs. J. theoretical biology 246, 260–268, https://doi.org/10.1016/j.jtbi.2006.12.036 (2007).
Article MathSciNet CAS Google Scholar
Paulau, P. V., Feenders, C. & Blasius, B. Motif analysis in directed ordered networks and applications to food webs. Sci. Reports 5, 11926, https://doi.org/10.1038/srep11926 (2015).
Article ADS Google Scholar
Borrelli, J. J. Selection against instability: stable subgraphs are most frequent in empirical food webs. Oikos 124, 1583–1588, https://doi.org/10.1111/oik.02176 (2015).
Article Google Scholar
Stouffer, D. B., Camacho, J., Jiang, W. & Amaral, L. A. N. Evidence for the existence of a robust pattern of prey selection in food webs. Proc. Royal Soc. Lond. B: Biol. Sci. 274, 1931–1940, https://doi.org/10.1098/rspb.2007.0571 (2007).
Article Google Scholar
Bascompte, J. & Melián, C. J. Simple trophic modules for complex food webs. Ecol. 86, 2868–2873, https://doi.org/10.1890/05-0101 (2005).
Article Google Scholar
Itzkovitz, S., Milo, R., Kashtan, N., Ziv, G. & Alon, U. Subgraphs in random networks. Phys. Rev. E 68, 026127, https://doi.org/10.1103/PhysRevE.68.026127 (2003).
Article ADS MathSciNet CAS Google Scholar
Milo, R. et al. Network motifs: simple building blocks of complex networks. Sci. (New York, N.Y.) 298, 824–7, https://doi.org/10.1126/science.298.5594.824 (2002).
Article ADS CAS Google Scholar
Milo, R. et al. Superfamilies of evolved and designed networks. Sci. (New York, N.Y.) 303, 1538–42, https://doi.org/10.1126/science.1089167 (2004).
Article ADS CAS Google Scholar
Polis, G. A. Complex trophic interactions in deserts: An empirical critique of food-web theory. The Am. Nat. 138, 123–155, https://doi.org/10.1086/285208 (1991).
Article Google Scholar
Prill, R. J., Iglesias, P. A. & Levchenko, A. Dynamic properties of network motifs contribute to biological network organization. PLoS Biol 3; https://doi.org/10.1371/journal.pbio.0030343 (2005).
Stouffer, D. B., Camacho, J. & Amaral, L. A. N. A robust measure of food web intervality. Proc.Natl. Acad. Sci. 103, 19015–19020, https://doi.org/10.1073/pnas.0603844103 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Williams, R. J. & Martinez, N. D. Simple rules yield complex food webs. Nat. 404, 180–183, https://doi.org/10.1038/35004572 (2000).
Article ADS CAS Google Scholar
Johnson, S., Domínguez-García, V., Donetti, L. & Muñoz, M. A. Trophic coherence determines food-web stability. Proc.Natl. Acad. Sci. 111, 17923–17928, https://doi.org/10.1073/pnas.1409077111 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Monteiro, A. B. & Faria, L. D. B. The interplay between population stability and food-web topology predicts the occurrence of motifs in complex food-webs. J. Theor. Biol. 409, 165–171, https://doi.org/10.1016/j.jtbi.2016.09.006 (2016).
Article PubMed Google Scholar
Allesina, S., Bodini, A. & Pascual, M. Functional links and robustness in food webs. Philos. Transactions Royal Society B: Biol. Sci. 364, 1701–1709, https://doi.org/10.1098/rstb.2008.0214 (2009).
Article Google Scholar
Vízquez, A. et al. The topological relationship between the large-scale attributes and local interaction patterns of complex networks. Proc.Natl. Acad. Sci. 101, 17940–17945, https://doi.org/10.1073/pnas.0406024101 (2004).
Article ADS Google Scholar
Domínguez-García, V., Pigolotti, S. & Muñoz, M. A. Inherent directionality explains the lack of feedback loops in empirical networks. Sci. Reports 4; https://doi.org/10.1038/srep07497 (2014).
Johnson, S. & Jones, N. S. Looplessness in networks is linked to trophic coherence. Proc.Natl. Acad. Sci. https://doi.org/10.1073/pnas.1613786114 (2017).
Google Scholar
Domínguez-García, V., Johnson, S. & Muñoz, M. A. Intervality and coherence in complex networks. Chaos 26; https://doi.org/10.1063/1.4953163 (2016).
Klaise, J. & Johnson, S. From neurons to epidemics: How trophic coherence affects spreading processes. Chaos 26; https://doi.org/10.1063/1.4953160 (2016).
Beber, M. E. et al. Artefacts in statistical analyses of network motifs: general framework and application to metabolic networks. J. The Royal Soc. 9, 3426–3435, https://doi.org/10.1098/rsif.2012.0490 (2012).
Google Scholar
Newman, M. E. J., Strogatz, S. H. & Watts, D. J. Random graphs with arbitrary degree distributions and their applications. Phys. Rev. E 64, 026118, https://doi.org/10.1103/PhysRevE.64.026118 (2001).
Article ADS CAS Google Scholar
Newman, M. Networks: An Introduction (Oxford University Press, Inc., New York, NY, USA, 2010).
Kashtan, N., Itzkovitz, S., Milo, R. & Alon, U. Mfinder tool guide. Department of Molecular Cell Biology and Computer Science and Applied Math., Weizmann Inst. of Science, Rehovot Israel, technical report (2002).
Everitt, B. S., Landau, S., Leese, M. & Stahl, D. Hierarchical Clustering, 71–110 (John Wiley & Sons, Ltd, 2011).
van Dongen, S. & Enright, A. J. Metric distances derived from cosine similarity and Pearson and Spearman correlations. ArXiv e-prints https://arxiv.org/abs/1208.3145 (2012).
Levine, S. Several measures of trophic structure applicable to complex food webs. J. Theor. Biol. 83, 195–207, https://doi.org/10.1016/0022-5193(80)90288-X (1980).
Article Google Scholar
Cohen, J. E. & Newman, C. M. A stochastic theory of community food webs: I. models and aggregated data. Proc. Royal Soc. Lond. B: Biol. Sci. 224, 421–448, https://doi.org/10.1098/rspb.1985.0042 (1985).
Article ADS Google Scholar
Cattin, M.-F., Bersier, L.-F., Banašek-Richter, C., Baltensperger, R. & Gabriel, J.-P. Phylogenetic constraints and adaptation explain food-web structure. Nat. 427, 835–839, https://doi.org/10.1038/nature02327 (2004).
Article ADS CAS Google Scholar
Stouffer, D. B., Camacho, J., Guimerà, R., Ng, C. A. & Nunes Amaral, L. A. Quantitative patterns in the structure of model and empirical food webs. Ecol. 86, 1301–1311, https://doi.org/10.1890/04-0957 (2005).
Article Google Scholar
Allesina, S., Alonso, D. & Pascual, M. A general model for food web structure. Sci. 320, 658–661, https://doi.org/10.1126/science.1156269 (2008).
Article ADS CAS Google Scholar
Williams, R. J. & Martinez, N. D. Success and its limits among structural models of complex food webs. J. Animal Ecol. 77, 512–519, https://doi.org/10.1111/j.1365-2656.2008.01362.x (2008).
Article Google Scholar
Thompson, R. M. & Townsend, C. R. Impacts on stream food webs of native and exotic forest: An intercontinental comparison. Ecol. 84, 145–161, https://doi.org/10.1890/0012-9658(2003)084[0145:iosfwo]2.0.co;2 (2003).
Article Google Scholar
Thompson, R. M. & Townsend, C. R. Energy availability, spatial heterogeneity and ecosystem size predict food-web structure in stream. Oikos 108, 137–148, https://doi.org/10.1111/j.0030-1299.2005.11600.x (2005).
Article Google Scholar
Townsend et al. Disturbance, resource supply, and food-web architecture in streams. Ecol. Letters 1, 200–209, https://doi.org/10.1046/j.1461-0248.1998.00039.x (1998).
Article Google Scholar
Yodzis, P. Local trophodynamics and the interaction of marine mammals and fisheries in the benguela ecosystem. J. Animal Ecol. 67, 635–658, https://doi.org/10.1046/j.1365-2656.1998.00224.x (1998).
Article Google Scholar
Havens, K. Scale and structure in natural food webs. Sci. 257, 1107–1109, https://doi.org/10.1126/science.257.5073.1107 (1992).
Article ADS CAS Google Scholar
Bascompte, J., Melián, C. & Sala, E. Interaction strength combinations and the overfishing of a marine food web. Proceedings of the National Academy of Sciences of the United States of America 102, 5443–5447, https://doi.org/10.1073/pnas.0501562102 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Opitz, S. Trophic interactions in Caribbean coral reefs. ICLARM Tech. Rep. 43, 341 (1996).
Google Scholar
Lafferty, K. D., Hechinger, R. F., Shaw, J. C., Whitney, K. L. & Kuris, A. M. Food webs and parasites in a salt marsh ecosystem. In Collinge, S. K. & Ray, C. (eds) Disease ecology: Community structure and pathogen dynamics, 119–134 (2006); DOI: https://doi.org/10.1093/acprof:oso/9780198567080.003.0009.
Ulanowicz, R. E. & Baird, D. Nutrient controls on ecosystem dynamics: the Chesapeake mesohaline community. Journal of Marine Systems 19, 159–172, https://doi.org/10.1016/S0924-7963(98)90017-3 (1999).
Article ADS Google Scholar
Abarca-Arenas, L. G. & Ulanowicz, R. E. The effects of taxonomic aggregation on network analysis. Ecological Modelling 149, 285–296, https://doi.org/10.1016/S0304-3800(01)00474-4 (2002).
Article Google Scholar
Polis, G. Complex trophic interactions in deserts: an empirical critique of food-web theory. Am. Nat. 138, 123–125, https://doi.org/10.1086/285208 (1991).
Article Google Scholar
Ulanowicz, R. Growth and development: Ecosystems phenomenology. springer, new york. pp 69–79. Network Analysis of Trophic Dynamics in South Florida Ecosystem, FY 97: The Florida Bay Ecosystem.; https://doi.org/10.1007/978-1-4612-4916-0 (1986).
Ulanowicz, R., Bondavalli, C. & Egnotovich., M. Spatial and temporal variation in the structure of a freshwater food web. Network Analysis of Trophic Dynamics in South Florida Ecosystem, FY 97: The Florida Bay Ecosystem. (1998).
Waide, R. B. & Reagan, D. P. The Food Web of a Tropical Rainforest (University of Chicago Press, Chicago, 1996).
Ulanowicz, R., Heymans, J. & Egnotovich, M. Network analysis of trophic dynamics in south florida ecosystems. Network Analysis of Trophic Dynamics in South Florida Ecosystems FY 99: The Graminoid Ecosystem. (2000).
Martinez, N. D., Hawkins, B. A., Dawah, H. A. & Feifarek, B. P. Effects of sampling effort on characterization of food-web structure. Ecol. 80, 1044–1055, https://doi.org/10.1890/0012-9658(1999)080[1044:eoseoc]2.0.co;2 (1999).
Article Google Scholar
Martinez, N. D. Artifacts or attributes? Effects of resolution on the Little Rock Lake food web. Ecol. Monogr. 61, 367–392, https://doi.org/10.2307/2937047 (1991).
Article Google Scholar
Riede, J. et al. Stepping in Elton’s footprints: a general scaling model for body masses and trophic levels across ecosystems. Ecol. Letters 14, 169–178, https://doi.org/10.1111/j.1461-0248.2010.01568.x (2011).
Article Google Scholar
Eklöf, A. et al. The dimensionality of ecological networks. Ecol. Letters 16, 577–583, https://doi.org/10.1111/ele.12081 (2013).
Article Google Scholar
Almunia, J., Basterretxea, G., Aristeguia, J. & Ulanowicz, R. Benthic-pelagic switching in a coastal subtropical lagoon. Estuarine, Coast. Shelf Sci. 49, 363–384, https://doi.org/10.1006/ecss.1999.0503 (1999).
Article ADS Google Scholar
Mason, D. Quantifying the impact of exotic invertebrate invaders on food web structure and function in the great lakes: A network analysis approach. Interim Prog. Rep. to Gt. Lakes Fish. Comm. yr 1 (2003).
Monaco, M. E. & Ulanowicz, R. E. Comparative ecosystem trophic structure of three u.s mid-atlantic estuaries. Marine Ecol. Progress Series 161, 239–254, https://doi.org/10.3354/meps161239 (1997).
Article ADS Google Scholar
Link, J. Does food web theory work for marine ecosystems? Mar. Ecol. Prog. Ser. 230, 1–9, https://doi.org/10.3354/meps230001 (2002).
Article ADS Google Scholar
Memmott, J., Martinez, N. D. & Cohen, J. E. Predators, parasitoids and pathogens: species richness, trophic generality and body sizes in a natural food web. J. Anim. Ecol. 69, 1–15, https://doi.org/10.1046/j.1365-2656.2000.00367.x (2000).
Article Google Scholar
Warren, P. H. Spatial and temporal variation in the structure of a freshwater food web. Oikos 55, 299–311, https://doi.org/10.2307/3565588 (1989).
Article Google Scholar
Christian, R. R. & Luczkovich, J. J. Organizing and understanding a winter’s Seagrass foodweb network through effective trophic levels. Ecol. Model. 117, 99–124, https://doi.org/10.1016/S0304-3800(99)00022-8 (1999).
Article Google Scholar
Goldwasser, L. & Roughgarden, J. A. Construction and analysis of a large Caribbean food web. Ecol. 74, 1216–1233, https://doi.org/10.2307/1940492 (1993).
Article Google Scholar
Jacob, U. et al. The role of body size in complex food webs. Adv.Eco. Res. 45, 181–223, https://doi.org/10.1016/b978-0-12-386475-8.00005-8 (2011).
Article Google Scholar
Huxham, M., Beaney, S. & Raffaelli, D. Do parasites reduce the chances of triangulation in a real food web? Oikos 76, 284–300, https://doi.org/10.2307/3546201 (1996).
Article Google Scholar

Download references

Acknowledgements

J.K. was supported by the EPSRC under grant EP/IO1358X/1. S.J. is grateful for support from Spanish MINECO Grant No. FIS2013-43201-P (FEDER funds).

Author information

Authors and Affiliations

Centre for Complexity Science, University of Warwick, Coventry, CV4 7AL, United Kingdom
Janis Klaise & Samuel Johnson
School of Mathematics, University of Birmingham, Birmingham, B15 2TT, United Kingdom
Samuel Johnson

Authors

Janis Klaise
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Johnson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.K. and S.J. designed the research and wrote the manuscript. J.K. analysed the data, performed the experiments and produced figures.

Corresponding author

Correspondence to Janis Klaise.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Supplementary Dataset 1

Supplementary Dataset 2

Supplementary Dataset 3

Supplementary Dataset 4

Supplementary Dataset 5

Supplementary Dataset 6

Supplementary Dataset 7

Supplementary Dataset 8

Supplementary Dataset 9

Supplementary Dataset 10

Supplementary Dataset 11

Supplementary Dataset 12

Supplementary Dataset 13

Supplementary Dataset 14

Supplementary Dataset 15

Supplementary Dataset 16

Supplementary Dataset 17

Supplementary Dataset 18

Supplementary Dataset 19

Supplementary Dataset 20

Supplementary Dataset 21

Supplementary Dataset 22

Supplementary Dataset 23

Supplementary Dataset 24

Supplementary Dataset 25

Supplementary Dataset 26

Supplementary Dataset 27

Supplementary Dataset 28

Supplementary Dataset 29

Supplementary Dataset 30

Supplementary Dataset 31

Supplementary Dataset 32

Supplementary Dataset 33

Supplementary Dataset 34

Supplementary Dataset 35

Supplementary Dataset 36

Supplementary Dataset 37

Supplementary Dataset 38

Supplementary Dataset 39

Supplementary Dataset 40

Supplementary Dataset 41

Supplementary Dataset 42

Supplementary Dataset 43

Supplementary Dataset 44

Supplementary Dataset 45

Supplementary Dataset 46

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Klaise, J., Johnson, S. The origin of motif families in food webs. Sci Rep 7, 16197 (2017). https://doi.org/10.1038/s41598-017-15496-1

Download citation

Received: 23 May 2017
Accepted: 27 July 2017
Published: 23 November 2017
DOI: https://doi.org/10.1038/s41598-017-15496-1

This article is cited by

Individual diet variability shapes the architecture of Antarctic benthic food webs
- Simona Sporta Caputi
- Jerzy Piotr Kabala
- Maria Letizia Costantini
Scientific Reports (2024)
A network simplification approach to ease topological studies about the food-web architecture
- Andrea Gini
- Simona Re
- Angelo Facchini
Scientific Reports (2022)
Organisational Social Influence on Directed Hierarchical Graphs, from Tyranny to Anarchy
- Charlie Pilgrim
- Weisi Guo
- Samuel Johnson
Scientific Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.