Mechanistic neutral models show that sampling biases drive the apparent explosion of early tetrapod diversity

Dunne, Emma M.; Thompson, Samuel E. D.; Butler, Richard J.; Rosindell, James; Close, Roger A.

doi:10.1038/s41559-023-02128-3

Download PDF

Article
Open access
Published: 27 July 2023

Mechanistic neutral models show that sampling biases drive the apparent explosion of early tetrapod diversity

Nature Ecology & Evolution volume 7, pages 1480–1489 (2023)Cite this article

4356 Accesses
1 Citations
36 Altmetric
Metrics details

Subjects

Abstract

Estimates of deep-time biodiversity typically rely on statistical methods to mitigate the impacts of sampling biases in the fossil record. However, these methods are limited by the spatial and temporal scale of the underlying data. Here we use a spatially explicit mechanistic model, based on neutral theory, to test hypotheses of early tetrapod diversity change during the late Carboniferous and early Permian, critical intervals for the diversification of vertebrate life on land. Our simulations suggest that apparent increases in early tetrapod diversity were not driven by local endemism following the ‘Carboniferous rainforest collapse’. Instead, changes in face-value diversity can be explained by variation in sampling intensity through time. Our results further demonstrate the importance of accounting for sampling biases in analyses of the fossil record and highlight the vast potential of mechanistic models, including neutral models, for testing hypotheses in palaeobiology.

Phylogenomics and the rise of the angiosperms

Article Open access 24 April 2024

Diversity-dependent speciation and extinction in hominins

Article Open access 17 April 2024

Complexity of avian evolution revealed by family-level genomes

Article 01 April 2024

Main

The establishment of terrestrial ecosystems and diversification of early tetrapods during the late Carboniferous and early Permian (323–272 million years ago (Ma)) was a key event in vertebrate evolution. This interval was punctuated by a climate change-driven floral turnover at the end of the Carboniferous, referred to as the ‘Carboniferous rainforest collapse’ (CRC)^1,2. In the past decade, several studies have attempted to estimate the impact of the CRC on the diversity of early tetrapods (early representatives of amphibians and amniotes). The first investigation into the impact of the CRC on early tetrapod diversification, by Sahney et al.³, hypothesized that habitat fragmentation caused by the CRC drove increased endemism in early tetrapod communities via the ‘island-biogeography effect’, causing allopatric speciation in newly isolated patches of forest⁴. Such increases in local endemism would, in turn, be expected to lead to a rise in beta diversity and global species richness, coupled with a decline in local richness (alpha diversity)³. This interpretation has been challenged, however, because it took the fossil record at face value and thus did not compensate for pervasive biases caused by various interconnected geological, taphonomic, anthropogenic and historical factors, which result in an uneven spatial and temporal distribution of fossil occurrences^3,4,5,6,7.

More recent investigations have pointed to sampling biases as a possible alternative explanation, and find no evidence of increases in endemism during or after the CRC^5,6. In particular, Dunne et al.⁶, after correcting for sampling, found evidence of increased connectedness between early tetrapod communities (for both amphibians and amniotes) and lower ‘global’ diversity following the CRC—the opposite of that reported by Sahney et al.³. The same study⁶ also suggested that fragmentation of the rainforest probably promoted the recovery and subsequent diversification of amniotes, a clade that today comprises reptiles, birds and mammals⁶. Despite these advances, the early tetrapod fossil record remains fragmentary as well as unevenly and incompletely sampled (particularly in the late Carboniferous)⁷, which obscures patterns of diversity and biogeography during a critical time in vertebrate evolution⁶. The true joint effects of sampling and environmental change on the diversification of tetrapods following the CRC are yet to be unravelled.

Quantitative approaches to correcting for the effects of sampling biases on estimates of past biodiversity typically rely on statistical or phylogenetic techniques^8,9. These approaches have led to substantial revisions of diversity patterns in many fossil groups, including early tetrapods^{10,11,12,13,14,15}.

Mechanistic neutral models provide an alternative and complementary approach that has not yet been used widely in palaeobiological studies (but see ref. ¹⁶). Neutral models assume individual dynamics are independent of species identity. Making this strong assumption puts the focus on sampling, habitat structure and dispersal in isolation from other potential complicating factors. It also permits use of particularly efficient simulation algorithms¹⁷, enabling us to study spatially explicit samples of individuals from a very large spatially explicit landscape that would be impractical to simulate mechanistically under alternative models. Crucially, such neutral simulations can specify landscapes with realistic size and structure, and enable features such as palaeogeography, habitat loss and habitat fragmentation to be manipulated experimentally¹⁸. Diversity can then be sampled from the simulations in the same locations and with the same intensity as the empirical data, thus providing a new way to test how real-world patterns of fossil record sampling impact inferred patterns of face-value (directly observed, ‘raw’ or uncorrected) diversity, under a range of hypothetical palaeogeographic or ecological scenarios. The mechanistic nature of neutral models also enables us to run the models with samples much larger than the empirical sample sizes. This allows us to predict how observed diversity patterns might change if the intensity of fossil sampling were increased by an order of magnitude, and to understand what patterns could be detected within the currently available fossil data¹⁹. Studies involving neutral simulations can even be used to test theories of diversity dynamics at global scales, much larger than could be directly quantified in the fossil record because of incomplete spatial sampling^16,20,21.

In this study, we apply a spatially explicit version of neutral theory to test the hypothesis that the CRC impacted early tetrapod diversity through habitat fragmentation. Our neutral simulations mimic the empirical structure of the fossil record by sampling at the same locations and to approximately the same intensity as recorded in the empirical or ‘known’ early tetrapod data. We investigate three scenarios related to the CRC. The first (scenario A; Fig. 1a) performs simulations on a ‘pristine’ landscape with no habitat fragmentation (that is, the CRC was absent from this scenario). The second (scenario B; Fig. 1a) models the effect of the CRC as random habitat loss across the landscape. The final scenario (C; Fig. 1a) models the effect of the CRC as a loss in habitat in which ‘habitat islands’ remained around the localities where early tetrapods occurred. We use these spatially explicit neutral simulations to examine the extent to which the empirical fossil record, given its sampling bias, can infer global patterns of diversity change. We estimate trends in tetrapod diversity over time, under a neutral scenario, by simulating our best-fitting models again with constant temporal sampling. This study is, to the best of our knowledge, the first to apply a fully spatially explicit neutral model to empirical fossil data.

**Fig. 1: Schematic outlining the study methodology.**

Results

Neutral models incorporating temporal and spatial biases

In the simplest scenario, we performed simulations on a pristine global landscape with no habitat fragmentation (scenario A; Fig. 1). We sampled diversity patterns from the simulations at the same palaeo-locations and to approximately the same intensity as the real fossil record. Results from the simulations with optimal fixed parameters matched the empirical fossil record well overall, with 80–85% mean accuracy across four diversity metrics: alpha diversity across all localities, mean alpha diversity, beta diversity and gamma diversity (Methods). The simulations could not, however, reproduce alpha, beta and gamma diversity well with the same fixed set of parameters. When compared with the empirical fossil record, the neutral models of early Permian communities with optimized fixed parameters predicted more species (higher gamma diversity), and higher alpha diversity than seen empirically (Fig. 2). Despite these differences, the majority of the empirical values are within the range of variation between simulations, with a mean accuracy across all metrics of 81.3% for amniotes and 82.1% for amphibians.

**Fig. 2: ‘Pristine’ landscape (scenario A).**

Neutral models under habitat fragmentation

To test the hypothesis that fragmentation of the rainforest at the end of the Carboniferous promoted the development of endemism among early tetrapod communities, we modelled two scenarios of habitat loss and fragmentation occurring from 307 Ma onwards: first, a random pattern of habitat loss (scenario B; Fig. 1) and second, a clustered pattern of habitat loss (scenario C; Fig. 1). The random habitat loss scenario (B) maintains connectivity across the landscape as habitat is lost. The clustered habitat scenario (C) leaves isolated habitat ‘islands’ that may promote endemism over geological timescales. These habitat ‘islands’ are conceptually analogous to the oceanic islands in MacArthur and Wilson’s theory of island biogeography⁴. Endemic species may thus arise naturally on such islands within neutral simulations. Scenario C directly tests the mechanistic assumption of Sahney et al.³ (that endemism, driven by fragmentation and manifesting as increasing beta diversity, is the cause of tetrapod diversity increases post-CRC).

Our models of random habitat loss (scenario B) demonstrate that increasing the amount of habitat loss, while keeping all other parameters the same, causes ‘global’ species richness and beta diversity to decline (Fig. 3 and Extended Data Fig. 1). Species richness decreases relatively linearly across all time periods. However, alpha and beta diversity demonstrate a more variable pattern across time for different levels of habitat loss. In particular, the interval between 307 and 297 Ma has very similar alpha diversity for all levels of habitat loss. This is potentially caused by the lower numbers of sampled fossils found at this time (Extended Data Fig. 2), because under poor sampling, inferred alpha diversity will be impacted primarily by the number of sampled individuals, rather than by other factors such as the quantity of surrounding habitat.

**Fig. 3: Random habitat loss (scenario B).**

Our clustered habitat (scenario C) tested whether neutral theory supports the hypothesis that habitat loss results in highly disconnected habitat islands that promote endemism. Under these circumstances, unless the fossil localities were close, dispersal between distinct fossil localities was restricted almost entirely, meaning that the number of shared species between localities was likely to be very low. The neutral simulations of the clustered habitat scenario generated diversity patterns that did not closely fit the empirical fossil data (Fig. 4). Although the overall trend matches to some extent, the simulations had a high level of variability between intervals, primarily dictated by the number of fossil localities known for each interval. Furthermore, loss of habitat, and the resulting decrease in the size of the metacommunity supplying individuals to the fossil sites, caused a reduction in species richness. Similarly, there was also a reduction in alpha diversity, particularly for amniotes.

**Fig. 4: Clustered habitat scenario (scenario C).**

By simulating this same best-fitting scenario (20% random habitat loss for amniotes and a pristine landscape for amphibians), but sampling more individuals at each locality, it is possible to predict the broader diversity changes under the same model beyond the empirical sample size. When ten times more individuals are sampled from each fossil locality, differences emerge compared with simulations in which sampling of the fossil record is exactly matched (Fig. 5). The general trends in species richness over time for both amniotes and amphibians are roughly similar to the trends observed in the fossil record (Fig. 5). However, there is no longer a significant increase in beta diversity post-CRC, especially for amphibians. Likewise, alpha diversity is relatively consistent over time. There is also a broader range in the simulation outcomes where many more individuals are sampled.

**Fig. 5: ‘Upscaled diversity’ from the fossil record using neutral models.**

To remove temporal variation in sampling intensity (but retain spatial sampling structure), we also simulated a model version with constant sampling effort over time. When 100 individuals are randomly selected from each time slice, in the same spatial arrangement as the empirically sampled localities, the trend in species richness (gamma diversity) over time tracks the changes in global diversity (Fig. 6). The simulated patterns in diversity where sampling effort is standardized bears only limited resemblance to the real fossil record together with its sampling biases; it matches the general trend only for beta diversity.

**Fig. 6: Simulated diversity where temporal biases are removed but spatial sampling structure is retained.**

Discussion

This work shows that the apparent increases in face-value diversity observed in the fossil record of early tetrapods across the late Carboniferous/early Permian can be explained by a simple mechanistic neutral model that accounts for biases in sampling. However, there does appear to be a small but observable change in the characteristics of early tetrapod diversity around 307 Ma, the approximate timing of the CRC. This can be explained by either changes in dispersal, changes in density of individual organisms (Extended Data Figs. 3 and 4) or fragmentation of habitats (which is theoretically similar to a reduction in species diversity; Fig. 3). These findings support the previous assessment that patterns of diversity in the early tetrapod fossil record should not be interpreted at face value⁶.

The model scenario of rainforest fragmentation that is most consistent with the empirical (face-value) fossil data is one in which the global density of individual early tetrapod organisms decreases by a small amount at 307 Ma (Fig. 3). When sampling the simulations in a realistic manner, this results in a temporary dip in the face-value gamma diversity and beta diversity of amphibians around the time of the CRC. By contrast, amniotes show an increase in both face-value beta and gamma diversity, suggesting a potential role of endemism, although this does not have much effect until 10 million years after the CRC. Under this scenario, simulated face-value gamma diversity losses during the CRC are even greater than those observed at face value in the fossil record (after accounting for the changes in sampling effort over time).

When many more individuals were sampled from the same simulation models, the emergent diversity patterns changed considerably because so much more of the underlying system is revealed (Fig. 5). For example, a larger sample may not uncover much more species-level diversity, suggesting that the already present species dominate with large abundances. This sensitivity to sampling suggests that the temporal changes in alpha and beta diversity found in the fossil record may disappear as more fossils are found. This shows that the effect of sampling bias can be mitigated to an extent by more intense sampling, even if the additional sampling is equally biased. When the same number of individuals are sampled from each point in time within our simulations, the trends in species richness and alpha diversity disappear to an extent (Fig. 6). This, again, suggests that the face-value patterns in the fossil record are an artefact of changes in the number of locations sampled within each time interval. The development of endemism does happen, as can be seen from increasing beta diversity following the CRC, for both amniotes and amphibians (Fig. 6). However, it is not enough to offset the alpha diversity decrease from habitat loss, suggesting that the effects of endemism often do not increase gamma diversity²² and in fact there is a small decrease in gamma diversity after the CRC, probably in response to habitat loss.

Taken together, our results suggest that endemism from habitat loss at the CRC would have probably led to a net decrease in gamma species richness, and not an increase as has been claimed previously by Sahney et al.³. After accounting for sampling bias, the limited changes to global richness are primarily driven by a modest reduction in global tetrapod population density over time, which is consistent with the expected ecological impact of the collapse of the rainforests and drying of the climate. The simulated scenario that aligns best with the empirical, face-value patterns is that of random habitat loss of between 0% and 20%, a scenario that is dynamically identical under neutral theory to an equivalent reduction in density¹⁷.

Our models used relatively abstract patterns of habitat loss, because the real patterns are not known. Future research could attempt to produce more realistic patterns of rainforest habitat loss, based on either palaeoclimate reconstructions or comprehensive occurrence data for fossil plants. Integrating more accurate maps of tropical rainforest coverage over time with the mechanistic basis of neutral theory would be more informative for exploring theories of diversity generation following the CRC. This is not currently possible because of the absence of readily available palaeoclimate reconstructions for this particular time interval, and the challenges associated with building comprehensive, spatially explicit, occurrence-based dataset for fossil plants. It is not immediately clear how one would relate forest patterns to the dynamics of early tetrapod diversity because amphibians and reptiles (both modern and extinct) exhibit broad variability in their dependency on forest cover. One immediate pattern of rainforest loss that could be incorporated into future related work, with the addition of empirical data, is the hypothesis that the rainforest disappearance began in western Pangaea before moving eastwards²³. Another key consideration for future research is deciphering the influence of hierarchical spatial scaling on the patterns recovered here; alpha, beta, and gamma diversity are ultimately nested and changes at the community scale can be reflected at larger scales^24,25.

The neutral models explored here assumed that abundances (population densities) were consistent over time (that is, the same number of individual organisms exist in each unit of habitat), except in the case of habitat loss at the CRC. However, the abundance of early tetrapods would also have a significant effect on the numbers of specimens preserved in the fossil record. Consequently, lower numbers of fossil specimens could be indicative of smaller populations and lower species richness. However, it is difficult to satisfactorily resolve the relationship between density and sampling rate because the nature of fossil preservation varies substantially over time, space and environments. Across our dataset of late Carboniferous and early Permian tetrapods, quality of preservation (and thus the size of the ‘taphonomic window’) varies substantially, which in turn influences sampling intensity (Extended Data Fig. 5). Fossil localities of late Carboniferous age that have yielded particularly well-preserved or abundant specimens are typically coal deposits^26,27 (for example, coal mines at Nyrañy in the Czech Republic and Linton Diamond Mine in Ohio, United States). In the early Permian, owing to the combination of orogenic activity (mountain building) and drier climatic conditions, fossils are much less likely to be preserved in coal deposits. Instead, many richly diverse localities in the early Permian are the remains of terrestrial environments such as floodplains, river systems and even caves²⁸, many of which have been quarried and excavated extensively over many decades (for example, various localities in the Red Beds of Texas and Oklahoma, United States). This lack of coal deposits in the early Permian also reflects the contraction of rainforest habitats across this interval, invoking the common-cause hypothesis, which states that the covariation of fossil and rock records is due to an external factor^29,30. Similarly, the disappearance of coal deposits might simultaneously affect taphonomic windows and true underlying biodiversity driven by the loss of rainforest habitat. Because of these temporal changes in preservation, it is impossible to infer true densities of early tetrapods during this interval (and probably any interval in the geological past). This limitation motivated keeping density as a free parameter within the neutral simulations, but precludes understanding of how both early tetrapod densities and preservation rates varied. Unravelling the true historical changes would require a better understanding of both the true densities of early tetrapods over time and changes to the preservation rates over time (one of the measures that is possible to estimate for species within assemblages).

Our explanations of the changes in early tetrapod diversity through time have all been based on ecological neutral theory. Alternative explanations could come from changes in non-neutral dynamics, such as species niche structure, competition between species or wider ecosystem-level shifts. These explanations cannot yet be tested from a mechanistic basis but represent an exciting avenue of future research, as do investigations of the minimal requirements for these models to have stronger predictive power.

Conclusions

Statistical approaches to estimating past biodiversity patterns can provide important insights into patterns of diversity^6,22,31. However, they are generally limited by the geographical and temporal extent of the available fossil occurrence data. In our study, spatially explicit neutral models have proven to be a valuable tool for directly testing established hypotheses of diversity change in the first vertebrates to emerge onto land, and illuminating the impacts of spatial and temporal sampling biases on their face-value diversity patterns.

Interdisciplinary studies integrating modern ecological theory with palaeontological data have been identified as crucial for informing predictions for future diversity^32,33 as well as more accurately understanding past biodiversity patterns^34,35,36,37. Our results shed new light on the impact of the CRC on early tetrapod diversity, by showing that increased endemism resulting from habitat loss at the CRC is unlikely to have produced an increase in biodiversity. Our study also offers new insights into the effects of sampling bias on fossil diversity estimates, and demonstrates the huge untapped potential that mechanistic models, such as those founded on neutral theory, have for testing hypotheses of deep-time biodiversity change.

Methods

Neutral models

Assessment of spatial and temporal biases on a mechanistic basis requires a model that is spatially and temporally explicit. In addition, to study the impact of habitat loss and fragmentation on biodiversity requires a model that can directly incorporate these dynamics within the biodiversity-generating process. Neutral models fulfil all these requirements and are also tractable at large scales. Neutral theory³⁸ assumes that the properties of an individual are independent of its species identity. The dynamics of neutral models are thus dictated by some combination of dispersal, ecological drift and speciation. The output of neutral models is a simulated ecological community, where each individual has an assigned species identity. These simulated communities are equivalent to a complete census of the simulated area. The communities provide a baseline for expected biodiversity under ‘idealized’ conditions³⁹, against which the biodiversity from real communities can be compared. Neutral theory has, however, only rarely been applied in analyses of fossil data. A few palaeoecological studies have used spatially implicit neutral theory^16,20,21 where populations (for example, within separate continents) are divided to roughly represent spatial barriers. To the best of our knowledge, however, no previous study has applied a fully spatially explicit neutral model to fossil data.

The classic, spatially implicit model³⁸ conceives of a local community connected to a metacommunity by immigration at a given rate; other models incorporate more explicit dispersal between parts of the landscape. Being based on fundamental biological mechanisms, neutral theory has utility for identifying underlying dynamics⁴⁰, acting as a null or ‘ideal’ model³⁹, or making predictions at broader spatial or temporal scales than are possible with field experiments⁴¹. We use a spatially explicit neutral model⁴² that incorporates the exact locations of each individual in space and incorporates a dispersal kernel to describe the distance moved by offspring from their parents. Such a fully spatially explicit model is essential to account for spatial sampling bias. The metacommunity concept of the spatially implicit model is replaced by movement around a broad spatially explicit landscape.

The mechanism of our model proceeds as follows: an individual is first chosen to die leaving a ‘space’ that will be filled by a newborn individual. The parent of the newborn individual is chosen from other nearby individuals according to a dispersal kernel, which we modelled as a two-dimensional normal distribution. The newborn is normally conspecific to its parent, but occasionally, with probability ν at each birth, it becomes a new species. Over many generations, nearby individuals are more likely to be the same species, whereas distant individuals will be more likely to be different. We use these models to generate communities of species across the landscape.

A major development for neutral theory was backwards-time coalescence methods⁴³, which produce equivalent results to a naïve (forwards-time) implementation of the mechanisms described above but are many orders of magnitude faster in computational performance. Furthermore, many scenarios are made possible with coalescence that are not possible otherwise, such as exceedingly large or infinite landscapes⁴² or sampling a small subset of individuals from the landscape without having to simulate the entire landscape first. The latter feature means that our models can simulate observations at just the precise locations observed in the fossil record, while accounting mechanistically for the whole community alive at the time with a full spatial structure from the relevant period in history. An equivalent model using forwards-time techniques would require simulating every tetrapod that existed across the entire time frame and continent of interest, a feat not remotely feasible with current computational power. Unfortunately, most non-neutral models cannot benefit from the use of coalescence and associated abilities to account for sampling in huge spatially explicit systems. We use the pycoalescence package available for Python and R¹⁷, which uses coalescence methods implemented in C++ for high-performance spatially explicit neutral simulations.

Preparation of fossil occurrence data

Data detailing the global occurrences of early tetrapod species from the late Carboniferous (Bashkirian) to early Permian (Kungurian) were downloaded from the Paleobiology Database (www.paleobiodb.org). These data represent the published knowledge on the global occurrences of early tetrapod species alongside taxonomic opinions; it is the result of a concerted effort to document the Palaeozoic terrestrial tetrapod fossil record. The dataset was cleaned by removing marine taxa, ichnotaxa and taxa with uncertain taxonomic identifications. The total number of amniote (including Reptiliomorpha (Table 1)) and amphibian (non-amniotes and early tetrapodomorphs (Table 1)) species per locality was ascertained and recorded (Extended Data Fig. 5). The resulting dataset (Supplementary information) details the number of amniote and amphibian species found at each locality (a ‘collection’ in Paleobiology Database terms) during each of the eight stratigraphic intervals from the Bashkirian to the Kungurian.

Table 1 Glossary of terms used in this study

Full size table

Neutral simulations of early tetrapods

We split the tetrapods into amphibians and amniotes to reflect their differing physiologies and environmental preferences, treating each with an independent neutral model. Our simulations required maps of the relative density of individuals across the globe. These were determined separately for each interval (Bashkirian to Kungurian) from the continental boundaries of the time. Global rasterized maps of individual relative densities were produced at 0.01-degree resolution using the continental extents provided by the Paleobiology Database based on GPlates palaeogeographical reconstructions⁴⁴. This corresponds to pixels of around 1 km² each representing a cell for our model. The palaeocoordinates of each fossil locality were calculated and localities were then aggregated within each 1 km² cell. Specimen counts per locality were estimated using the ‘occurrences-squared’ heuristic⁴⁵, calculated simply as the square of the number of unique fossil occurrences. This metric provides a basic way of accounting for the fact that most localities lack information about counts of specimens, and because it is rarely obvious how many distinct individuals contributed to a set of fossil fragments. Using this metric in our models approximates the total number of individuals that contributed to the observed fossil record and therefore the number of individuals that should be sampled in the neutral simulations. This generated a ‘sample map’ defining the number of individuals to be sampled at each position in space. Because the majority of the globe was not sampled, most cells in this sample map were set to 0. The relative density and the sample maps together contain the spatial information of the entire global community of amphibians and amniotes for the simulation and define which individuals from each global community were sampled.

The second parameter critical for the simulations is the dispersal rate (σ), which controls the distance that individuals disperse across the landscape in a given generation. σ is used as the variance in a Rayleigh distribution determining the radius of dispersal, with a separate uniform random number determining the direction of dispersal. This means that larger values of σ correspond with longer dispersal distances, on average.

The eight stratigraphic intervals sampled from the fossil record were sufficiently far apart in time that we reasonably assumed no shared species between the different time intervals within the model. Consequently, we ran simulations for each time interval as separate neutral models in parallel, and aggregated the communities post-simulation.

We performed simulations with parameters encompassing a broad range of biologically feasible values: density values for habitat cells ranged from 25 to 1,000 individuals per km for ‘habitat’ regions (non-habitat regions have a density of 0 individuals), the parameter of dispersal (σ) varied to give mean distances of 0.1–14 km, and speciation rates varied from 10⁻⁸ to 10⁻¹. We explored 5 density and 5 dispersal parameters giving 25 combinations using Latin hypercube sampling⁴⁶ to evenly sample from arithmetic parameter space. Under coalescence methods, higher speciation rates can be applied post-simulation for generating communities^17,43. We performed simulations using a minimum speciation rate of 10^–8 and applied all other speciation rates afterwards to generate additional communities.

Three broad scenarios of tetrapod diversity were simulated (Fig. 1). In all models, the global landscape was restricted by continental boundaries. Our simplest model (scenario A) contained pristine habitat with no habitat loss (that is, uniform, with no habitat fragmentation) Two scenarios (B and C) exhibited habitat loss of different forms following the CRC. The landscape was fragmented according to a random spatial pattern, so that land areas contained habitat on a percentage of their area (either 20%, 40% or 80% of habitat remaining). The random pattern was generated by randomly removing pixels from the landscape until the desired percentage of habitat remains.

Model parameterization

To determine how well the simulations fit the patterns in the fossil record, four biodiversity metrics were used for each interval: the alpha diversity (ɑ) for each fossil locality (that is, the local species richness), the mean alpha diversity across all localities, the total species richness across all localities (γ) and the mean beta diversity (calculated as \(\beta =\frac{\gamma }{\alpha }\)) across all localities. The mean actual percentage error between the real and simulated fossil records in alpha diversity for each locality was averaged to get a mean alpha accuracy μ_α. The mean actual percentage error between the real and simulated fossil records was calculated for each other metric (α, β and γ). Averaging the mean actual percentage errors for the four metrics (μ_α, α, β and γ) gives an indication of the goodness of fit for one simulation—we refer to this percentage as the accuracy of a single simulation. There is some redundancy between the values because the parameters are not independent, but the approach should still result in the simulation that most closely matches the real fossil record.

Because each interval was run as a separate neutral simulation, the parameters of speciation rate, density and dispersal could be allowed to vary over time. However, because combinations of parameters can be aggregated in any number of ways, we considered just two possibilities that reflected our assumptions of the possible ecological changes over time: either there was no change in these parameters (we use a single parameter set for all intervals); or the parameters could change at the time of the CRC (we use two parameter sets, one for pre-CRC (323–307 Ma) and one for post-CRC (307–372 Ma)). The first scenario represents a neutral ecosystem with no changes in fundamental ecological dynamics. The second presents a neutral scenario that assumes ecological changes were generated by the CRC and may be reflected in neutral dynamics. In some tests a single set of parameters (speciation rate, dispersal and density) was used for all time intervals, whereas in others this requirement was relaxed to investigate how the parameters themselves may change over time.

Upscaling and downscaling simulated communities

To explore potential biodiversity patterns that would emerge if the fossil record included a larger number of individuals, we ran simulations with the same model parameters as the best-fitting simulations, including the same number of simulated individuals, but reporting back on ten times more sampled individuals (sampled with replacement). This scenario demonstrates how the emergent biodiversity patterns change with the overall intensity of sampling effort in isolation from other factors. We also explored the effect of sampling a fixed number of individuals from each time interval, for comparison with sampling different numbers of individuals from each time interval in line with the temporal changes in sampling intensity present in the fossil record. This samples from the simulation without sampling-intensity biases over time but retains the sampling-intensity biases over space matching the real-world spatial sampling pattern. It enables us to demonstrate the effect of temporal sampling biases in isolation from other factors.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All relevant data supporting our analyses are available in the OSF repository: https://doi.org/10.17605/OSF.IO/ZGHWB.

Code availability

Code for downloading and cleaning fossil occurrence data from the Paleobiology Database is available at: https://github.com/emmadunne/neutral_theory_tetrapods. Code for running simulations using pycoalescence is available at: https://github.com/thompsonsed/palaeo_neutral_sims. Both are accessible through the OSF repository: https://doi.org/10.17605/OSF.IO/ZGHWB

References

Uhl, D. & Cleal, C. J. Late Carboniferous vegetation change in lowland and intramontane basins in Germany. Int. J. Coal Geol. 83, 318–328 (2010).
Article CAS Google Scholar
Cleal, C. J. et al. Plant biodiversity changes in Carboniferous tropical wetlands. Earth Sci. Rev. 114, 124–155 (2012).
Article Google Scholar
Sahney, S., Benton, M. J. & Falcon-Lang, H. J. Rainforest collapse triggered Carboniferous tetrapod diversification in Euramerica. Geology 38, 1079–1082 (2010).
Article Google Scholar
Macarthur, R. H. & Wilson, E. O. The Theory of Island Biogeography (Princeton Univ. Press, 1967).
Brocklehurst, N., Dunne, E. M., Cashmore, D. D. & Frӧbisch, J. Physical and environmental drivers of Paleozoic tetrapod dispersal across Pangaea. Nat. Commun. 9, 5216 (2018).
Article CAS PubMed PubMed Central Google Scholar
Dunne, E. M. et al. Diversity change during the rise of tetrapods and the impact of the ‘Carboniferous rainforest collapse’. Proc. R. Soc. B 285, 20172730 (2018).
Article PubMed PubMed Central Google Scholar
Clack, J. A. et al. Phylogenetic and environmental context of a Tournaisian tetrapod fauna. Nat. Ecol. Evol. 1, 2 (2016).
Article PubMed Google Scholar
Alroy, J. On four measures of taxonomic richness. Paleobiology 46, 158–175 (2020).
Article Google Scholar
Close, R. A., Evers, S. W., Alroy, J. & Butler, R. J. How should we estimate diversity in the fossil record? Testing richness estimators using sampling-standardised discovery curves. Methods Ecol. Evol. 9, 1386–1400 (2018).
Article Google Scholar
Benson, R. B. J. & Upchurch, P. Diversity trends in the establishment of terrestrial vertebrate ecosystems: interactions between spatial and temporal sampling biases. Geology 41, 43–46 (2013).
Article Google Scholar
Benton, M. J., Ruta, M., Dunhill, A. M. & Sakamoto, M. The first half of tetrapod evolution, sampling proxies, and fossil record quality. Palaeogeogr. Palaeoclimatol. Palaeoecol. 372, 18–41 (2013).
Article Google Scholar
Pearson, M. R., Benson, R. B. J., Upchurch, P., Fröbisch, J. & Kammerer, C. F. Reconstructing the diversity of early terrestrial herbivorous tetrapods. Palaeogeogr. Palaeoclimatol. Palaeoecol. 372, 42–49 (2013).
Article Google Scholar
Brocklehurst, N., Day, M. O., Rubidge, B. S. & Fröbisch, J. Olson’s Extinction and the latitudinal biodiversity gradient of tetrapods in the Permian. Proc. R. Soc. B 284, 20170231 (2017).
Article PubMed PubMed Central Google Scholar
Pardo, J. D., Small, B. J., Milner, A. R. & Huttenlocker, A. K. Carboniferous–Permian climate change constrained early land vertebrate radiations. Nat. Ecol. Evol. 3, 200–206 (2019).
Article PubMed Google Scholar
Brocklehurst, N. Olson’s Gap or Olson’s Extinction? A Bayesian tip-dating approach to resolving stratigraphic uncertainty. Proc. R. Soc. B 287, 20200154 (2020).
Article PubMed PubMed Central Google Scholar
Holland, S. M. Diversity and tectonics: predictions from neutral theory. Paleobiology 44, 219–236 (2018).
Article Google Scholar
Thompson, S. E. D., Chisholm, R. A. & Rosindell, J. pycoalescence and rcoalescence: packages for simulating spatially explicit neutral models of biodiversity. Methods Ecol. Evol. 11, 1237–1246 (2020).
Article Google Scholar
Thompson, S. E. D., Chisholm, R. A. & Rosindell, J. Characterising extinction debt following habitat fragmentation using neutral theory. Ecol. Lett. 22, 2087–2096 (2019).
Article PubMed Google Scholar
Brocklehurst, N. A simulation-based examination of residual diversity estimates as a method of correcting for sampling bias. Palaeontol. Electronica 18, 1–15 (2015).
Google Scholar
Holland, S. M. & Sclafani, J. A. Phanerozoic diversity and neutral theory. Paleobiology 41, 369–376 (2015).
Article Google Scholar
Jordan, S. M. R., Barraclough, T. G. & Rosindell, J. Quantifying the effects of the break up of Pangaea on global terrestrial diversification with neutral theory. Phil. Trans. R. Soc. B 371, 20150221 (2016).
Article PubMed PubMed Central Google Scholar
Close, R. A. et al. The apparent exponential radiation of Phanerozoic land vertebrates is an artefact of spatial sampling biases. Proc. R. Soc. B 287, 20200372 (2020).
Article PubMed PubMed Central Google Scholar
Cleal, C. J. & Thomas, B. A. Palaeozoic tropical rainforests and their effect on global climates: is the past the key to the present? Geobiology 3, 13–31 (2005).
Article CAS Google Scholar
Sepkoski, J. J. Alpha, beta, or gamma: where does all the diversity go? Paleobiology 14, 221–234 (1988).
Article PubMed Google Scholar
Patzkowsky, M. E. Origin and evolution of regional biotas: a deep-time perspective. Annu. Rev. Earth Planet. Sci. 45, 471–495 (2017).
Article CAS Google Scholar
Hook, R. W., Ferm, J. C., Whittington, H. B. & Morris, S. C. A depositional model for the Linton tetrapod assemblage (Westphalian D, Upper Carboniferous) and its palaeoenvironmental significance. Phil. Trans. R. Soc. B 311, 101–109 (1997).
Google Scholar
Ó Gogáin, A. et al. Metamorphism as the cause of bone alteration in the Jarrow assemblage (Langsettian, Pennsylvanian) of Ireland. Palaeontology 65, e12628 (2022).
Article Google Scholar
MacDougall, M. J., Tabor, N. J., Woodhead, J., Daoust, A. R. & Reisz, R. R. The unique preservational environment of the Early Permian (Cisuralian) fossiliferous cave deposits of the Richards Spur locality, Oklahoma. Palaeogeogr. Palaeoclimatol. Palaeoecol. 475, 1–11 (2017).
Article Google Scholar
Dunhill, A. M., Hannisdal, B. & Benton, M. J. Disentangling rock record bias and common-cause from redundancy in the British fossil record. Nat. Commun. 5, 4818 (2014).
Article CAS PubMed Google Scholar
Peters, S. E. & Heim, N. A. Macrostratigraphy and macroevolution in marine environments: testing the common-cause hypothesis. Geol. Soc. Spec. Publ. 358, 95–104 (2011).
Article Google Scholar
Mannion, P. D. et al. A temperate palaeodiversity peak in Mesozoic dinosaurs and evidence for Late Cretaceous geographical partitioning. Glob. Ecol. Biogeogr. 21, 898–908 (2012).
Article Google Scholar
Benton, M. J. Origins of biodiversity. PLoS Biol. 14, e2000724 (2016).
Article PubMed PubMed Central Google Scholar
Barnosky, A. D. et al. Merging paleobiology with conservation biology to guide the future of terrestrial ecosystems. Science 355, eaah4787 (2017).
Article PubMed Google Scholar
Jackson, J. B. C. & Johnson, K. G. Measuring past biodiversity. Science 293, 2401–2404 (2001).
Article CAS PubMed Google Scholar
Willis, K. J. & Birks, H. J. B. What is natural? The need for a long-term perspective in biodiversity conservation. Science 314, 1261–1265 (2006).
Article CAS PubMed Google Scholar
Bonuso, N. Shortening the gap between modern community ecology and evolutionary paleoecology. PALAIOS 22, 455–456 (2007).
Article Google Scholar
Mayhew, P. J., Jenkins, G. B. & Benton, T. G. A long-term association between global temperature and biodiversity, origination and extinction in the fossil record. Proc. R. Soc. B 275, 47–53 (2007).
Article PubMed Central Google Scholar
Hubbell, S. P. The Unified Neutral Theory of Biodiversity and Biogeography (MPB-32) (Princeton Univ. Press, 2001).
Alonso, D., Etienne, R. S. & McKane, A. J. The merits of neutral theory. Trends Ecol. Evol. 21, 451–457 (2006).
Article PubMed Google Scholar
Vergnon, R., Dulvy, N. K. & Freckleton, R. P. Niches versus neutrality: uncovering the drivers of diversity in a species-rich community. Ecol. Lett. 12, 1079–1090 (2009).
Article PubMed Google Scholar
Rahbek, C. et al. Predicting continental-scale patterns of bird species richness with spatially explicit models. Proc. R. Soc. B 274, 165–174 (2006).
Article PubMed Central Google Scholar
Rosindell, J. & Cornell, S. J. Species–area relationships from a spatially explicit neutral model in an infinite landscape. Ecol. Lett. 10, 586–595 (2007).
Article PubMed Google Scholar
Rosindell, J., Wong, Y. & Etienne, R. S. A coalescence approach to spatial neutral ecology. Ecol. Inform. 3, 259–271 (2008).
Article Google Scholar
Seton, M. et al. Global continental and ocean basin reconstructions since 200Ma. Earth Sci. Rev. 113, 212–270 (2012).
Article Google Scholar
Alroy, J. New methods for quantifying macroevolutionary patterns and processes. Paleobiology 26, 707–733 (2000).
Article Google Scholar
McKay, M. D., Beckman, R. J. & Conover, W. J. A comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics 21, 239–245 (1979).
Google Scholar

Download references

Acknowledgements

We thank all contributors to the Paleobiology Database, in particular T. Liebrecht, R. Whatley, J. Dummasch, J. Alroy and M. Carrano. E.M.D. thanks T. Dunkley-Jones and P. Mannion for helpful comments and discussion. E.M.D., R.A.C. and R.J.B. were funded by the European Union’s Horizon 2020 research and innovation programme under grant agreement 637483 (European Research Council Starting Grant TERRA to R.J.B.), and also through a Leverhulme Research Project Grant (RPG-2019-365 to R.J.B.). R.A.C. was also funded by a Royal Society University Research Fellowship (RF\ERE\210396). S.E.D.T. was funded by the Imperial–National University of Singapore Joint PhD Scholarship. J.R. was funded by a Natural Environment Research Council fellowship (NE/L011611/1) and a Leverhulme Trust Research Fellowship (RF-2022-497). Through J.R. and S.E.D.T., this study is an output of the Georgina Mace Centre for the Living Planet at Imperial College London. All simulations were performed on high-throughput computing systems at Imperial College London. For the purpose of open access, the authors have applied a ‘Creative Commons Attribution' (CC BY) licence to any author accepted manuscript version arising. This is Paleobiology Database official publication number 454.

Author information

These authors contributed equally: Emma M. Dunne, Samuel E. D. Thompson.
These authors jointly supervised this work: Richard J. Butler, James Rosindell, Roger A. Close.

Authors and Affiliations

GeoZentrum Nordbayern, Friedrich-Alexander University Erlangen-Nürnberg (FAU), Erlangen, Germany
Emma M. Dunne
School of Geography, Earth and Environmental Sciences, University of Birmingham, Birmingham, UK
Emma M. Dunne & Richard J. Butler
Department of Life Sciences, Imperial College London, Ascot, UK
Samuel E. D. Thompson & James Rosindell
Department of Biological Sciences, National University of Singapore, Singapore, Singapore
Samuel E. D. Thompson
Department of Earth Sciences, University of Oxford, Oxford, UK
Roger A. Close

Authors

Emma M. Dunne
View author publications
You can also search for this author in PubMed Google Scholar
Samuel E. D. Thompson
View author publications
You can also search for this author in PubMed Google Scholar
Richard J. Butler
View author publications
You can also search for this author in PubMed Google Scholar
James Rosindell
View author publications
You can also search for this author in PubMed Google Scholar
Roger A. Close
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.A.C. and J.R. conceived the project and all authors input into the design. E.M.D. and S.E.D.T. curated the data, conducted the analyses, prepared the figures and led the writing of the paper. All authors contributed to collating material for the supplementary information, and to the writing and approval of the final paper.

Corresponding author

Correspondence to Emma M. Dunne.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Ecology & Evolution thanks Lauren Sallan and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Simulated tetrapod diversity patterns over time compared against the fossil record, including habitat loss.

Simulated tetrapod diversity patterns over time compared against the fossil record (that is face-value, unstandardized counts of species). Predictions of tetrapod biodiversity patterns are produced by a neutral model parameterised with 80% habitat remaining (20% loss) and then simulated with different levels of remaining habitat (that is 100%, 40% and 20% habitat remaining). The shaded areas surrounding the dashed lines represent the variation in the five best fitting simulations. The dashed vertical line at 307 Ma indicates the timing of the CRC. For definitions of diversity measures see Table 1. The following abbreviations are used for intervals: ‘Ba’ = Bashkirian, ‘Mo’ = Moscovian, ‘Ks’ = Kasimovian, ‘Gz’ = Gzhelian, ‘As’ = Asselian, ‘Sa’ = Sakmarian, ‘Ar’ = Artinskian and ‘Ku’ = Kungurian.

Extended Data Fig. 2 Biodiversity metrics through time using uncorrected fossil data.

Raw data from the fossil record, indicating biodiversity metrics (alpha diversity, beta diversity and total species richness) and the number of individuals (that is fossils) and collections over time. Interval abbreviations are as in Extended Data Fig. 1.

Extended Data Fig. 3 Predictions of diversity from neutral model parameterised on Carboniferous diversity only.

Simulated tetrapod diversity patterns over time compared against the fossil record (that is face-value, unstandardized counts of species). Predictions of tetrapod diversity from a neutral model parameterised solely on Carboniferous diversity. Three metrics of biodiversity (alpha, beta, and gamma diversity; Table 1) are shown for both amphibians and amniotes from the Bashkirian to Kungurian from empirical data (solid black lines) and from simulated communities (dashed lines). The shaded areas surrounding the dashed lines represent the variation in the five best fitting simulations. The dashed vertical line at 307 Ma indicates the timing of the CRC. For definitions of diversity measures see Table 1. Interval abbreviations are as in Extended Data Fig. 1.

Extended Data Fig. 4 Predictions of diversity from neutral model parameterised on both Carboniferous and Permian diversity.

Simulated tetrapod diversity patterns over time compared against the fossil record (that is face-value, unstandardized counts of species). Predictions of tetrapod biodiversity patterns are produced by a neutral model parameterised separately for the late Carboniferous (pre-307 Ma) and Permian (post-307 Ma). The shaded areas surrounding the dashed lines represent the variation in the five best fitting simulations. The dashed vertical line at 307 Ma indicates the timing of the CRC. For definitions of diversity measures see Table 1 (main text). Interval abbreviations are as in Extended Data Fig. 1.

Extended Data Fig. 5 Palaeogeographical maps of fossil localities in each study interval.

Global palaeogeographical maps showing the localities of fossil sites in each stage of the late Carboniferous and early Permian. The size and colour of each circle corresponds to the number of species found at each site. Continental configurations are provided by GPlates via the chronosphere R package.

Supplementary information

Supplementary Information

Supplementary Figs. 1–5.

Reporting Summary

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dunne, E.M., Thompson, S.E.D., Butler, R.J. et al. Mechanistic neutral models show that sampling biases drive the apparent explosion of early tetrapod diversity. Nat Ecol Evol 7, 1480–1489 (2023). https://doi.org/10.1038/s41559-023-02128-3

Download citation

Received: 01 March 2022
Accepted: 20 June 2023
Published: 27 July 2023
Issue Date: September 2023
DOI: https://doi.org/10.1038/s41559-023-02128-3