Introduction

Lead (Pb), a toxic, non-essential metal for life, has been an important commodity in our technological development1 and one of the most persistent anthropogenic pollutants through time2,3. Lead may be released directly from the mining and smelting of Pb-containing ores, and more recently, through the extensive use of Pb-containing fuels, batteries, paints and other commodities4. Lead is among the best-studied metal pollutants, due in part to its toxicity, even at low levels, to multiple organ systems5, and its particular danger to young children and developing foetuses6. As such, understanding the past interplay between natural and anthropogenic sources, sinks and cycles of Pb is vital, as evidence suggests humans have greatly impacted its natural biogeochemical cycle7,8. Records of pollution indicate anthropogenic influences as far back as the Bronze Age3,9,10. Particularly large amounts of Pb were released into the atmosphere during the Roman period in Europe, with the geochemical imprint of Pb pollution especially quantifiable in peat records7,9,11,10,,12. Evidence of such activity may even be observed in Greenland ice, clear indication of the persistence of Pb pollution even in such pristine environments13,14. Further anthropogenic atmospheric Pb enrichment has been observed in records located close to centres of mining in Medieval times9,15,16, and upon the introduction of Pb-containing gasoline in the early 20th century, virtually across the globe10,17,18,19,20, even as far from mining centres as Antarctica21.

Knowledge of the pollution activity at the sites of mining and smelting, and assessment of contamination levels at different sites is not always sufficient for firm conclusions regarding Pb provenance and contributions from changing sources or contributors through time. In an effort to clarify such questions, Pb isotopes have been long used as ‘fingerprints’ of the source of Pb recorded in environmental archives3,4,12. Within the natural environment, Pb is mainly present as four stable isotopes: 208Pb (≈52%), 206Pb (≈24%), 207Pb (≈23%) and 204Pb (≈1%)4. 204Pb represents the only Pb isotope not a product of radioactive decay chains, with 206Pb, 207Pb and 208Pb originating in the decay chains of 238U, 235U and 232Th, respectively4,22. Thus, the ratios of different Pb isotopes depend on the concentrations of U, Th and Pb in various source materials, the length of the half-lives of the parent isotopes23 and the amount of time that passed. In environmental science, Pb isotopic data are commonly presented as ratios, with 206Pb/207Pb, 208Pb/206Pb typically used24. Analyses of ore bodies have indicated the variability of Pb isotope ratios depends on the type of ore, age, and style of mineralization25. A simple example is the difference observable between the 206Pb/207Pb ratio of very old ores such as the Broken Hill ore (formed 1.6–1.8 billion years ago, and recently used in European leaded petrol), with 206Pb/207Pb = 1.05–1.126,27,28 whilst younger ores (most Jurassic to Miocene ores) fall within the range 206Pb/207Pb = 1.18–1.2529. This is because when ores form, their Pb isotope ratios reflect their crustal source, a reservoir whose Pb isotopic ratios increase slowly with time due to the radioactive decay of U and Th. After ore formation, no radiogenic growth occurs because the ores contain little or no U or Th. Thus, the Pb isotopic ratios remain identical to the crustal composition at time of ore formation30,31.

Crucially for archaeometallurgical inferences, the Pb isotopic ratios are not severely altered by physical or chemical fractionation processes during ore processing, smelting, and casting, although some minor variations may occur32. Therefore it can be assumed the Pb isotope values of the pollution-related Pb are very similar to those of the parent ore bodies33. As such, precise isotopic ratios may be used to ‘link’ Pb deposited in a substrate (e.g. bogs, lakes, ice) to a certain ore4,33. It has been demonstrated that records of Pb concentration from sedimentary archives such as peat bogs, lake sediments and ice cores may retain the isotopic signature of the original source of Pb, since once deposited within such environments, Pb is largely immobile34. Such work has clearly indicated the impact of metal pollution from both local sources35, and long range atmospheric transport9,10,14 through time. However, the signal recorded within a bog or lake represents a mixture of many local, regional and extraregional inputs, including contribution from natural sources (dust, biomass burning, volcanoes, etc) and disentangling these has proven challenging36.

Moreover, without a modelling component in interpreting Pb records from sedimentary archives, typical approaches may only be used to indicate likely sources, rather than infer quantitative apportionments. Traditionally, interpretation of Pb isotopic ratios is performed via a 3-isotope, or 4-isotope plots (see Fig. 1a), and comparing the location of the sink mixture data (the isotope signal recorded within the archive) to the isotopic signatures of known sources. Attempts have been made to model this sink mixture as a way of quantitatively assessing the relationships between the Pb recorded within a sediment and its source37,38. Until now however, Pb isotope mixing models have generally been limited to considering either two37,39,40 or three38 potential sources. This approach is adequate in a very well determined system, where only a few possible sources may be considered, or only a couple of broad sources are of interest37,39. This approach allows identification of very general source inputs, such as anthropogenic vs lithogenic Pb. Other studies have been able to distinguish petrol-derived Pb41, or local ore-derived Pb from other anthropogenic sources42, but the lack of further source consideration precludes defined provenance of such Pb to an ore body or mining region in a complex mixture37. This is due to the limitations related to the models and the Pb isotope systems themselves, where generally only two or three isotope ratios (n) are interpreted, and thus limiting such models to n + 1 sources. These types of models, therefore, may be useful in a general overview, but the complexity of potential Pb inputs, particularly in an anthropogenically polluted world, cause such approaches to be likely oversimplifications, with only limited conclusions possible37.

Figure 1
figure 1

Theoretical examples. Figure displays input (a) in the form of convex hulls; three panels, each displaying a three-isotope plot with the corresponding sources and mixtures to be modelled. Dashed black lines have been added to outline the mixing envelope, as defined by the mean values for each source (black circles). Also displayed are mixtures which fall within the envelope blue crosses) and one which does not (pink circles). Panel (b) displays the output from mixing envelope modelling, and the probability of each sample falling within the mixing envelope (%). (c) Is an example posterior density graph as output by MixSIAR, here from the model of mixture 2. Each field corresponds to the probability of a given source to provide the Pb present in the mixture. (d) Is the overall output of all four mixtures modelled in this example. In each case, the rectangle indicates the range of outputs, with the upper and lower bounds signifying the 2.5% and 97.5% credible intervals. (e) Displays convex hulls of theoretical example 2. Black circles denote the sources used to model the mixture (blue cross).

In the field of biology, mixing models have long been used to apportion large numbers of prey sources within, for example, a predator’s diet43,44,45, from stable isotope compositions using mixing models such as IsoSource46 and MixSIAR47,48. When IsoSource was developed, an initial attempt to apply it to pollution sourcing via Pb isotopes was made46. However, to our knowledge, little further work to further apply this method to the field has been done49, likely due to diffuse and poorly constrained outcome sources. Other approaches to Pb isotope modelling have been applied, with Euclidean distance calculation used to determine potential source(s) for archaeological artefacts50,51. This method, however, simply indicates the most likely ore fields (or specific mines) within a specific database.

Application of more complex models to Pb isotope tracing, therefore, could provide a novel approach to source apportionment, with potentially a better understanding of past pollution input than might be achieved solely through 3-isotope graphing or Euclidean distance modelling. In practice, this may be able to produce more reliable tracing of Pb in the environment, because far more potential sources can be considered, and less simplification of the system is necessary. This approach could be applied to the interpretation of indirect records (such as from peat, lake sediments, ice) of historical pollution, modern pollution, and even the origin of metal used in archaeological artefacts.

Here we propose a new approach, using a number of real-world examples. First we outline the principles, using a pair of theoretical models. We then apply the method to pre-anthropogenic samples from a peat bog in North-western Spain52 to track the changing influence of Saharan dust on the Pb isotope ratio. A second real-world example utilises data from the same site but from a period of heavy pollution (namely the Roman period) to trace likely ore sources of Pb deposited within the bog. Finally, we exploit the ability of the model to consider a very large number of sources, and investigate the origin of Pb in a litharge roll discovered within ancient mining works in Rosia Montana mine, Romania, and dating back to the Roman period53.

Materials and Methods

Mixing Models

As the focus of the study is on the application of complex mixing models to Pb isotopes from bulk sediment samples and artefacts, we do not outline the theory behind simple binary, or ternary mixing approaches30,37,38. For disentangling multiple sources we considered two possible approaches: 1) IsoSource, a mass balance-based multi-source mixing model46 and 2) MixSIAR47, the latest iteration of a series of Bayesian mixing models48,54.

IsoSource examines all possible source contribution configurations, at user-defined intervals (e.g. 1%). These calculations rely upon the following equations, expressed below for a configuration with one isotope system and three sources:

$${X}_{M}={f}_{A}{X}_{A}+{f}_{B}{X}_{B}+{f}_{C}{X}_{C}$$
(1)
$$1={f}_{A}+{f}_{B}+{f}_{C}$$
(2)

where:

X = Isotopic signature of the mixture (M) or sources (A,B,C).

f A , f B , f C  = Proportion of Pb from each source.

X A , X B , X C  = Source isotopic signatures.

This is an underdetermined system containing three unknowns and two equations, with no unique solution46. However, the predicted mixture signature for each source combination can be compared to the observed mixture signature; if it matches, then mass balance is achieved and that source combination is a feasible solution. Because combinations can only be examined in discrete steps, a small tolerance for deviation from exact matches is allowed46. These equations make interpretation of datasets with n isotope systems and more than n + 1 sources possible45,46,49. Outputs are generally presented as range values, to ensure that all potential combinations are reported45.

IsoSource may provide valuable model output, and prior to development of Bayesian isotopic mixing models, it was the main approach for food web studies55. However, IsoSource uses only mean isotopic signatures for sources and mixtures and does not directly account for variability in source and mixture ratios, including sampling and measurement errors. Because ore bodies are typically heterogeneous, ascribing a single value is a simplification. Complex models have recently been developed which do incorporate these uncertainties within the framework of rigorous Bayesian statistics, utilising the same basic principles as outlined in equations 1 and 248. Put simply, this approach allows the user to incorporate prior information about the system, including analytical error and source heterogeneity, and using a natural distribution (i.e., Dirichlet distribution56) allows the model to develop solutions, in terms of percentage contributions which sum to unity. The Dirichlet-defined distribution is intentionally vague, allowing for the model to be driven primarily by the data input by the user. Model fits are developed via many Markov Chain Monte Carlo (MCMC) simulations, which produce simulations of plausible source proportions based upon the data, and probability densities. Any mixtures not probabilistically consistent with the data are discarded, and mixtures developed at the end of the run are required to be close to the early ones in terms of proportion, causing a Markov chain to be converged48. These, when combined with the prior information (e.g. results of previous studies if available) produce posterior distributions which are true probability distributions for each source48.

Models built around a Bayesian approach include MixSIAR47, which is a combination of MixSIR54 and SIAR48, as well as FRUITS57, IsotopeR58, and others, which are slightly varied approaches to the same problem. Here we propose an approach for applying a state-of-the-art Bayesian stable isotope mixing model (MixSIAR) to pollution sourcing and artefact tracing via Pb isotopes. “MixSIAR” is a package which runs in the R framework. Details on downloading and setting up the package may be found at https://cran.r-project.org/web/packages/MixSIAR/index.html.

Outline of the approach, with two theoretical applications

Pre-Modelling Preparation

Prior to attempting data modelling, the user must ensure that all potential sources (mining regions active or probably active during the period of study, active smelters, potential natural Pb sources) that may have contributed to the Pb isotope signal at the study site are considered in the analysis. It should be noted that for pollution or archaeological applications, the approach is likely complicated by the fact that potential sources will have changed over time, or that ores once actively mined may have been exhausted. When developing source lists, all potential sources (with sufficient data available) must be considered, regardless of locality, due to the reality of long-range transport of Pb14,59. Further, for pollution tracing of sediment samples, a local ‘natural’ Pb isotope signal should be introduced to the model, to ensure any such local input (e.g. erosion products) of Pb is considered.

All data must be collated, and a database of all signatures prepared. As previously suggested, Pb isotope ratios relative to 204Pb should be used (i.e. 208Pb/204Pb, 207Pb/204Pb and 206Pb/204Pb). This allows for as much of the possible variability in the isotopic signature to be considered during the modelling, and because each isotope originates from a different decay chain23, a degree of independence between the three ratios. If several analyses are available for the same site, these data should be averaged before standard deviations are calculated. This allows for errors related to either measurement uncertainty or source heterogeneity to be incorporated into the model.

Because MixSIAR uses standard deviations and means to develop models, multivariate normal distributions of the isotopic compositions of ores are assumed. Previous work appears to suggest this is not necessarily the case60,61. This is typically ascribed to the complex origins of most ore bodies, in some cases with multiple depositional phases of Pb or polymetallic (Pb containing) ores at the same site common, resulting in multimodal distributions and a degree of kurtosis. However, it has been argued such multimodal distributions may, when enough data are presented, still be representative of multivariate normal (MVN), as multiple depositional events may even one another out, averaging the data62. Discussion of this point may be disputable as all such studies investigated the behaviour of the 3-dimensional mixing space developed between the ratios 206Pb/204Pb, 207Pb/206Pb and 208Pb/206Pb, and we encourage modelling of the envelope of isotopes normalised to 204Pb (208Pb/204Pb, 207Pb/204Pb and 206Pb/204Pb).

To determine the distribution of the ore bodies considered here, and to ensure MiXSIAR is an appropriate approach, we have performed a number of tests investigating the nature of their isotopic distributions. First, using univariate Shapiro-Wilk and Kolmogorov-Smirnov tests, we determine the normality of each isotope ratio in each field individually, prior to investigation of MVN of the three-ratio system via further tests (Mardia, Henze-Zirkler and Royston). All univariate analysis was performed on SPSS 22, and multivariate analysis using the package “MVN” in R63.

Results (see Table 1) indicate normality is typical for the distributions of 206Pb/204Pb (78% of ore bodies), 207Pb/204Pb (82%) and 208Pb/204Pb (69%). MVN examinations (Table 2) indicate most ore bodies may be considered MVN via at least one of the tests (78%), with smaller databases (e.g. Apuseni Porphyry, Mazarron and Oberlausitz) more likely to be MVN, but with larger databases also displaying MVN (e.g. Almeria, Gaul and Valsugana VMS). Notably, some complex and heterogeneous ore bodies (e.g. Apuseni Epithermal), are not MVN, indicating multiple mineralization phases, as observed at Rosia Montana, part of the Apuseni Epithermal ore body53, do not even out the isotope distribution range. The MVN distribution of a majority of ore bodies, and the uncertain nature of a number more appear to indicate an approach, which assumes MVN may be applicable. Crucially, the outcomes from these tests are inconclusive, neither proving nor disproving categorically an MVN nature of lead isotopes. As a result, an approach, which applies means and SDs appears the simplest and most appropriate target for such analyses.

Table 1 Univariate normality tests for all ore bodies used to construct models in this work.
Table 2 Multivariate normality (MVN) tests results for each ore body mentioned in this work.

Consequently, grouping, one of the most important parts of this analysis, should be approached robustly and can be done a priori, or a posteriori64. A priori grouping should be carried out to align sources (prior to the analysis), which are very similar in isotopic signature, or are all from the same ore field, but ensuring any MVN is not lost. Significance tests65 may be performed, to ensure that the groups are significantly different. If not, they should be grouped together.

Pre-Run Checks

Prior to running the model, checks must be performed to ensure the data for the unknown mixture fall within the mixing envelope46 as determined via the sources identified. Due to the nature of MixSIAR47 results are produced even when these are not realistic. The mixing envelope can be visualized in each 2-dimensional cross-section of the isotope space (see Fig. 1a). To be explained as simple mixtures of these sources, mixtures must fall within the space defined by the various sources. If they do not, then it must be considered that there is an issue in the working hypotheses: either errors during measurement, or a significant source of Pb has been overlooked in the analysis46. Because Bayesian modelling incorporates uncertainties (in this case means and standard deviations), a further Bayesian modelling tool may be used to ascertain exactly which samples fall within the mixing polygon66. From this further model, any samples which fall within the polygon in <5% of iterations should be discarded. Further, if a significant number are not constrained within the polygon, this is an indication the model being applied is not correct and should be reconsidered. An instance of this issue may be observed from the example in Fig. 1a, where one sample (shown by a red cross) falls outside the designated potential sources. Model results confirm this (Fig. 1b), indicating there is 0% chance of the sample falling within the polygon, and therefore it is not interpreted.

Post-Run Checks

Once all the preparation steps have been completed, the model employs during its run Markov Chain Monte Carlo (MCMC) statistics, which investigate probability distributions for variables (in this example the proportion of Pb originating from each source). From these ‘posterior’ distributions (see Fig. 1c), descriptive statistics including mean, standard deviation and range may be produced. It must be noted such outputs are only indicative of proportion of Pb from each source.

To ensure meaningful results, the MCMC chains must be long enough for convergence to be achieved. Convergence of the data may be checked via two diagnostics post-run: Gelman-Rubin and Gewecke tests47,65. The Gelman-Rubin test requires at least 1 chain, and values will be near 1 at convergence, therefore high values indicate non-convergence. The Gewecke test compares the two halves of the MCMC chain and develops z-scores to determine convergence. High z-scores are an immediate indicator of non-convergence, and so a longer run must be performed47.

In practice, with many sources and three isotope ratios, MixSIAR generally must be set to either ‘long’ or ‘very long’ MCMC chains to achieve satisfactory convergence. However, this may vary, and so it is a valuable exercise checking the post-run diagnostics, and in running the model at various lengths. For all models discussed here, ‘long’ or ‘very long’ MCMC chains have been run.

A Posteriori Grouping

When only a small number of well-constrained sources are considered, valuable results may be derived from the raw model outputs (Fig. 1d). However, Pb isotope modelling generally considers many sources, and often the output from a model run could be insufficiently well-defined, sometimes with few clear major contributors. In the second theoretical example (Fig. 1e, Table 3) this is true, with most sources contributing less than 5%, and those with high contributions showing large ranges (e.g. Source 8, Table 3). At this point, we suggest further grouping a posteriori65. This combines raw contribution results in order to produce better-constrained relative contributions, but at the expense of specificity within sources. In practice, this means a user will not be able to discern between ore bodies but will be able to indicate more reliably which groups of ore bodies (or major mining regions) are major contributors. As with a priori grouping, this should be performed on groups which are isotopically similar or are located close to each other, allowing for meaningful conclusions to be drawn, despite the simplification this approach causes.

Table 3 Model outputs from theoretical example 2, displaying the raw MixSIAR output.

To display the value of such an approach we have performed a posteriori analysis, combining sources which are isotopically similar, for minimising the number of possible end-members (see Table 3). For each possible mixture, all sources to be grouped should be summed for each individual solution64. In this way, a new distribution is created for the proportional contributions of the combined sources. The combination of sources 2, 11, 12, 13 and 15 in our theoretical example displays this (Table 4), with the contribution of this grouped source now observable, where before either the sources did not appear to contribute much to the mixture (e.g. Source 3 in Table 3) or displayed large possible ranges (e.g. Source 8 in Table 3). This approach has its advantages in clarifying broad source areas but it does remove some of the certainty to the approach requiring grouping which at times erases distinct sources. Nevertheless, without grouping, it may be difficult to say that any sources provided meaningful contributions to the Pb isotope signature, and so a posteriori grouping is a valuable tool when interpreting data which initially appears unconstrained.

Table 4 Model output from theoretical example 2, indicating output after grouping a posteriori.

This data, once produced, should be presented either as it is here, in table form (e.g. Table 3) and via figures (e.g. Fig. 1d), but always outlining the entire range of results65, and not simply the mean.

Availability of materials and data

All lead isotope data used in this study were published previously and are summarized in tables in Supplementary Information. The model results generated during the current study are available from the corresponding author on request. The tool to allow for Bayesian modelling of a mixing envelope is an R code and can be downloaded from http://www.famer.unsw.edu.au/software/polygon.html. The MixSIAR model is an R package, which may be downloaded from https://cran.r-project.org/web/packages/MixSIAR/index.html.

Results and Discussion

Example 1: Pre-Anthropogenic Dust Sourcing

Due to the prevalence of anthropogenic Pb in the biogeochemical system since the inception of metallurgy, Pb isotopes are an imperfect tracer of natural fluxes during the recent past67. However, because of the ability of Pb isotopes to record source signatures, they have been applied to dust tracking studies68. Using a selection of natural geogenic sources, we attempt to indicate the source of pre-human Pb in the geochemical cycle within Penido Vello (PVO, Fig. 2), a peat bog in north-west Spain52, for the period 5000–1300 BCE.

Figure 2
figure 2

Map of western Europe displaying mining regions used to model contributions to Penido Vello bog (blue circle).

Using sources determined in the original publication (local rock, local soil52 and a Saharan dust value69, with the authors assuming negligible anthropogenic input), we have modelled the mixing envelope, indicating only a small number of samples fall inside (Fig. 3a,b). Moreover, a large number of samples (76%) fall a long way from the envelope, due to at least two unconsidered sources (Fig. 3a). As such, by estimating what the composition of these sources must be, and then comparing with published data from plausible sources, we determine two further inputs to the PVO site. The first of these, as speculated by the original publication, appears a volcanic input from the Azores, whilst the second is indicative of a loess-derived input. For demonstration purposes we use a value from a loess field in northern France to represent this source70, and data from a selection of lava and ash analysed in the Azores71,72,73. Modelling of the mixing space now indicates the majority of mixtures fall within (Fig. 3a,c), and so we have proceeded to model the relative inputs of the five sources. Since only five potential sources are considered, no grouping has been performed a priori (SI Table 1).

Figure 3
figure 3

Convex hulls (a) from pre-anthropogenic dust tracing. Black circles denote sources used to reconstruct valid substrate mixtures from Penido Vello between 5000–1200 BCE52. Mixing envelopes as defined by the original publication are marked with solid black lines, whilst additional sources as suggested here result in mixing envelopes denoted by black dashed lines. Samples which fall within the convex hulls are marked in blue whilst those which fall outside one or more are pink, and have not been modelled. (b) Displays the output from the envelope mixing model using the original publications, (c) with additional sources. (d) Is the model output from the valid mixtures, displaying changing source contributions to the Pb contained within the mixture. In each case, the rectangle indicates the range of model outputs, with the upper and lower bounds signifying the 2.5% and 97.5% credible intervals, while black lines trace the mean value.

Using this new five-source model, a number of conclusions may be drawn. Samples appear to show periodic influx of Saharan dust into the site, with two main periods of higher (>15%) Saharan Pb proportions (3300–2800 BCE and 2100–1300 BCE) corresponding well with the original interpretation, where simple distance-based modelling indicated clear Saharan input midway (3300–2800 BCE)52. Outside of these time periods, the Pb isotope signal appears to be related to geogenic weathering, with local rock and soil the major Pb contributors throughout (Fig. 3d). A decreasing trend throughout the whole period for local rock (Fig. 3d) and increasing local soil proportions (Fig. 4c) appears indicative of local soil erosion related to farming and agriculture, with an increase in intensity throughout the time period, echoing evidence from pollen records74. The ability of our model to indicate a volcanic influence on the site is clear, with regular pulses in Azore-derived Pb (Fig. 3d).

Figure 4
figure 4

Convex hulls (ac) from Penido Vello bog during the Roman period. As in a, black circles denote sources, with each labelled in panel 5e. Panel 5d displays the output from mixing envelope modelling, indicating all samples fall within the mixing space, and so there are no red crosses on panels 4a–c. 4 f is the model output from Penido Vello between 600 BCE and 550 CE. Red rectangles correspond to a posteriori grouping of all anthropogenic sources, while yellow rectangles are the modelled contribution for Pb from natural sources. Rectangles indicate upper and lower bounds as in Fig. 3, with black lines indicating the mean. Also shown (blue line) are the raw reported Pb concentrations for the core throughout this period52.

Example 2: Pollution Sourcing from Environmental Archives

The Roman period in Iberia was characterized by intense exploitation of natural resources. To disentangle the complex signals pertaining to possible anthropogenic sources of Pb to PVO during this period, we use a selection of potential sources (See SI Table 2), including a natural pre-pollution aerosol value, as determined by the authors52. Here, no a priori grouping was performed prior to modelling. Again, the location of the samples in the mixing envelope has been indicated graphically (Fig. 4a–c), and via a model (Fig. 4d), confirming all samples fall within the mixing envelope.

The first main point of interest is the changing contribution of natural (as represented by pre-pollution aerosol) and anthropogenic (the remainder) sources (Fig. 4f). Early in the period, the anthropogenic sources (all ore bodies) make up ~60% of the Pb, prior to increasing values over the period 350–200 BCE, before plateauing at roughly 90% between 100 BCE–150 CE (Fig. 4f). When compared to conclusions from the original publication, our data suggests a much higher proportion of Pb from anthropogenic sources, perhaps indicative of a previous underestimation of metal production. These values reflect the development and decline of the Roman metal industry, from the acquisition of Hispania during the Punic wars by 146 BCE75, through their exploitation, to the combined pressure of source exhaustion2 and repeated invasions of Hispania from 170 CE onwards76. The coherence of the modelled record with the raw Pb concentration data (Fig. 4f) indicates that much of the Pb deposited during this time period was anthropogenic, indicating values of ~90% for the anthropogenic fraction seem sensible.

Unfortunately, due to overlapping fields, many of the modelled ore-related proportion distributions are unconstrained. Some conclusions are still possible, with contributions from Almeria, Andalusia, Mazarron and Cartagena (after a posteriori grouping as south-west Spain), Huelva and the Leonese Zone peaking after the Punic wars (roughly 200 BCE), and falling away by 200 CE (Fig. 5), reflecting the developmental trend indicated by overall anthropogenic proportion changes (Fig. 5). Reocin (of the Basque-Cantabrian basin in Northern Spain) appears to peak just slightly before the Roman invasion (275 BCE), suggesting exploitation by the Iron Age Iberian cultures. Further exploitation of Reocin through the remainder of the period corroborates previous indications of mineral resource exploitation in the region during the Roman period77,78, evidence for which has been destroyed by more recent utilisation. Such conclusions indicate the value of such a modelling approach, allowing the user to interpret provenance in much finer detail, tracing pollution back to exact ore fields.

Figure 5
figure 5

Model output from Penido Vello between 600 BCE and 550 CE, displaying the contribution of selected mining regions to the Pb mixture recorded in the sediments through this period. Again, rectangles indicate upper and lower bounds as in Fig. 3, with black lines indicating the mean.

Interestingly, and previously unconsidered by the original publication, mining outside of Spain appears to have also had an impact on the isotopic mixture recorded here, with the contribution of the Mendip hills and Shropshire (grouped as England) appearing to change from very low contribution values at the start of the period, gradually rising after 10 CE, peaking by 150 CE and dropping off after 250 CE (Fig. 5). Such a profile reflects the well-constrained history of British lead mining, with evidence of mining in the Mendips from c.49 CE79, prior to considerable fluctuations in lead production throughout the period of Roman rule80.

Example 3: Modelling of Artefact Provenance

Another potential application of MixSIAR and isotope mixing is the interpretation of the Pb isotope signal as recorded in individual metal artefacts. Determining the provenance of metal artefacts is of crucial importance in archaeology, and Pb isotopes have previously been used to this end in several studies50,81,82. Here we apply MixSIAR, in much the same way as done above, to the Pb isotope mixture of a litharge roll recovered from a Roman-age mining site in Romania53. To determine source contributions quantitatively, a dataset of potential sources has been assembled (SI Table 3, Fig. 6).

Figure 6
figure 6

(a) Map of eastern Europe outlining the location of the Romanian artefact (Litharge roll, black circle). Also displayed are all mining regions modelled to understand the provenance of the Pb within the artefact. (b) displays convex hulls for modelling the litharge roll, Romania. As before, black circles reflect the sources used to reconstruct possible sources of Pb, whilst the green cross indicates the isotope composition of the modelled artefact.

Due to the large number of sources, initial results indicate a wide range of potential origins for the Pb, but with Moldova Nouă ore field apparently the clearest single source (Table 5). To interpret the data more clearly, a posteriori grouping has been performed. Here, sources have generally been grouped by proximity (e.g. Baia Mare fields, and German fields), with others grouped by mineralisation and similar isotopic signature. An example of this is the inclusion of Apuseni Volcanogenic Massive Sulphides with other ores from the same mineralisation belt; the Banat Metallogenic Province83 such as Moldova Nouă and Majdanpek (Table 6, Fig. 6).

Table 5 Raw MixSIAR results for the litharge roll.
Table 6 A posteriori grouping of raw MixSIAR results as displayed in Table 5, from the Romanian litharge roll.

After grouping sources as listed in Table 6, it can be stated that most Pb present in the artefact comes from the Banatites (average of nearly 50%) whilst Baia Mare Pb appears to constitute much of the remainder. Interestingly, the Apuseni group, which contains the Rosia Montana deposit where the litharge roll was discovered, does not contribute significantly (Table 6). This is very much in line with the conclusions reached in the original publication53, who suggested that the Pb present in the litharge roll was not from the Apuseni Mountains. Our model appears to suggest a Banat source as dominant, and within that, Moldova Nouă as the most likely mining field. Moldova Nouă, and its porphyry copper deposits84 has been exploited since Roman times, when Au and Ag were extracted from base metal ores85, with the typical method exploiting the chemical properties of litharge to bring the noble metals into the lead prior to further separation53,86.

It must be mentioned here that artefacts are much more likely to be composed of a one (or more) Pb ore source(s), in this case likely the Moldova Nouă ore field. The nature of the model, however is that it considers contributions from all sources to satisfy the mass balance, and therefore all sources may have a contribution, even when, as it appears here, the actual source is one ore field. As such, we suggest caution when interpreting such outputs. When one field dominates as it does here, we suggest that the user interprets this as a single, dominant source, and does not over-interpret other contributions. A posteriori grouping may be helpful at this point, as it is in this example, where the combination with other Banatitic ores leads to the clear domination of this signal (Table 6).

Period pollution

One of the further advantages of MixSIAR is that it was developed to investigate populations of species, and not just individuals. As such it allows the user to analyse more than one mixture at a time and provide overall results for the dataset. In practice, this can allow a user not just to obtain proportions at discrete points in time, but to get an overall indication for an entire period. In the example used here, all mixtures have been run separately, and increases and decreases in certain sources may be observed. When all mixtures are run together, a much broader picture of pollution over a longer period may be produced. This is a faster process than modelling each individual mixture, and therefore may be useful for getting a better understanding of the dataset, or for comparing a few datasets covering a short period.

Conclusions

Bayesian mixing models, as developed for food-web studies, may be applied to work focussing on understanding sources of pollution as documented by Pb isotope analyses. In a system where many sources are likely, they are a much more realistic approximation of the real world than the simple binary or ternary mixing currently utilised for such studies. Furthermore, they allow for incorporation of errors, and as such provide a more complex and exhaustive approach.

The model results, of course, depend on the quality of the data input. As such, it is vital thorough literature reviews are carried out prior to any analysis, and that all potential sources are considered. This is likely to produce very wide range values, for many sources. An a posteriori grouping approach as done here for the Roman period litharge roll may be applied to clarify what are likely to be the main sources, albeit at the cost of some specificity. Some systems, however, will not produce reliable or clear results. If the isotopic range for sources are similar or overlap significantly, the model may be unable to distinguish separate inputs to the mixture. In other cases, the existence of unknown and uncharacterised sources may result in the mixture falling outside the mixing envelope; in such cases, results are meaningless. A rigorous approach to source characterisation and model parametrisation should rule out errors such as these. Finally, we recommend using such models in tandem with more traditional provenance methods (e.g. 3-isotope plotting of all data); such a dual method approach is likely to produce the most reliable source characterisation. The few examples outlined here indicate the value of such an approach, first by clearly identifying which models require the consideration of further sources, or by identifying exact ore regions as major contributors to a mixture.