Introduction

Tritium (3H; half-life 4500 ± 8 days) is a hydrogen radioisotope widely used to study water movement in terrestrial critical zones, the oceans, and to estimate the residence times of water in near-surface aquifers and surface water bodies1,2,3,4,5,6. Tritium in environmental water is often expressed in TU (Tritium Units), which equals to one molecule of 3H1HO per 1018 molecules of 1H2O, or 0.11919 Bq kg−17, and is produced naturally in ultra-trace quantities in Earth’s upper atmosphere via by the interaction of cosmic rays with N nuclei, at significantly higher rates at high geomagnetic latitudes8. Tritium quickly oxidizes to form 1H3HO (water) and thereby enters the global water cycle as water vapour and precipitation.

From 1945 to 1963 ca. 450 surface atmospheric nuclear bomb tests were conducted, thereby injecting large amounts of anthropogenic 3H into the stratosphere, increasing by orders of magnitude the natural 3H levels in precipitation9. Tritium activities on the order of 5000 to 10,000 TU were measured in rainfall in the early 1960s in the Northern hemisphere, challenging human health exposure guidelines for this radioisotope (900–90,000 TU) 10. Following the 1963 Partial Nuclear Test Ban Treaty on atmospheric thermonuclear tests, 3H concentrations in rain rapidly dropped, followed by a slow return to pre-bomb levels (1–25 TU) by the mid 2000s11. Currently, anthropogenic tritium in Earth’s environmental waters is mainly detectable around local sources like nuclear power or fuel reprocessing plants, medical, and other waste-related activities10. Historically, there are no direct measurements of pre-nuclear age 3H in precipitation before the 1950s; however, estimates for the temperate and cold climates have been made in the 1960’s using pre-1945 vintage wines12 and dated ice cores.

Since the 1960s, the International Atomic Energy Agency (IAEA), in cooperation with the World Meteorological Organization, coordinated collection of 3H data on the global atmospheric fallout in precipitation through its Global Network of Isotopes in Precipitation (GNIP; https://nucleus.iaea.org/wiser). While the decadal decline in 3H in rainfall to pre-nuclear levels (< 1–5 TU for most tropical and subtropical regions) constitutes an analytical detection limit challenge, tritium remains an invaluable short-lived radioisotope for gaining key information on modern freshwater dynamics, critical zone hydrology, and recharge rates to phreatic aquifers. Rising interest in using tritium for protecting Earth’s critical zone water resources necessitates accurate estimates of water replenishment rates powerfully achieved using 3H or 3H/3He groundwater age dating methods13 or for the mapping of groundwater vulnerability to pollution14. Local and regional hydrological flow models using 3H require accurate data on the contemporary tritium content in precipitation. Unfortunately, 3H monitoring sites are few and far between, resulting in scientists relying on data from a distal GNIP station or on spatiotemporal extrapolation, which results in best-guess or inaccurate estimates of 3H inputs for groundwater modelling, recharge rates, or water flux determinations15. Predictive maps alleviate this problem through data-driven models that guide the interpolation of the continuous space between observed 3H data points16.

3H in global precipitation

We examined contemporary global distribution patterns of 3H in Earth’s precipitation within the “post-bomb” hydrologic timeframe11, in the absence of new anthropogenic 3H alterations having global impacts other than natural solar flux cycles17,18,19 and constraining our analysis to the years 2008–18 for data availability reasons. We focused our efforts on monthly precipitation composites and 3H data from the IAEA GNIP database20 and published data (see discussion in Supplementary Material SM1), with the aim to produce high spatial resolution predictive maps of annual 3H in Earth’s rainfall. The curated 3H dataset comprised 233 stations for the post-bomb timeframe, with the criteria that they were not affected by anthropogenic sources; most had a minimum continuous monthly record of at least 3 years between 2008 and 2018 (Fig. 1). From this curated dataset, we excluded GNIP sites in Antarctica (n = 4) due to scarcity of observations and the special effects of stratospheric-tropospheric exchange processes controlling 3H levels in snow on this continent21. We modified IAEA’s regionalized cluster-based water isotope prediction model for stable isotopes in precipitation (RCWIP)22,23 by incorporating 1H3HO as the predictor variable and considering a wide range of explanatory spatial regressors (longitude [LON], weighted latitude [wLT], altitude [ALT], land mass fraction [LMF], and distance to coast [DTC]), as well as climatic and environmental covariates (air temperature [AT], precipitation [PP], radiation-related and other predictors). We selected the best-fit regional predictor model and performed model validation by cross checking the predicted versus observed data (“Methods”).

Figure 1
figure 1

Global patterns of 3H in precipitation. Model gridded and measured contemporary (2008–2018) 3H contents in Earth’s precipitation at GNIP stations (circles). The tritium distribution map (“isoscape”) was developed using geostatistical techniques described in the Regionalized Cluster-Based Water Isotope Prediction model (RCWIP2). See text and SM1 for details. Figure created in R 4.1.0 (with ggplot2 3.3.5, raster 3.4–13 and rgdal 1.5–23 libraries, all https://cran.r-project.org/).

Prediction models for 3H in global precipitation

Three best-fit multiple-regression models were found to accurately predict 3H in worldwide annual precipitation. All models had a high degree of confidence and accuracy, whether it was a (i) global, (ii) extratropical, or a (iii) tropical model (Supplementary Material SM1). The most parsimonious multiple regression predictor models were:

$${\text{Global}}\,^{{3}} {\text{H }} \,\left( {{\text{TU}}} \right) \, = \, 0.0313{\text{ wLT }} + \, 2.56 \, 10^{{ - {6}}} \,{\text{DTC }} \,{-} \, 0.2077{\text{ AT }} + \,6.2068{\text{ LMF }} + \, 0.0056{\text{ LONG }} - \, 0.0279{\text{ NLR }} + \, { 61}.{3961}$$
(1)

which is a first global model resulting in an R2 = 0.79 (p < 0.01), where air temperature (AT), wLT, LMF, and DTC explained 26%, 24%, 17% and 9%, respectively, of the overall variance (net longwave radiation [NLR] had only a minor share). As expected, these regressors are the well-known temperature and “continentality effects” on 3H distributions9,24. The significant response of wLT affirmed the need to specifically account for northern and southern hemispheric differences in the distribution of their land masses. However, the global model (Eq. 1) did not perform as well in tropical and lower latitude regions. Accordingly, it was beneficial to develop the secondary extra-tropical and tropical regional models (see Discussion in23):

$${\text{Extratropical}}\,^{{3}} {\text{H }} \, = \, - 0.2505{\text{ AT }} + \, 8.0153{\text{ LMF }} + \, { 3}.0{4 \,1}0^{{ - {6}}}\, {\text{ DTC }} + \, 0.0053{\text{ LONG }} + \, 0.0176{\text{ wLT}} \,{-} \,0.0011{\text{ ALT }} + \,71.7114$$
(2)

yielding an R2 = 0.81 (p < 0.01), and with AT, LMF, wLT, and DTC explaining 24%, 23%, 19%, and 12% each of the variability (ALT and LONG were of lesser relevance). For the tropical and low-latitude model, a different set of predictor variables were required, owing to lower cosmogenic 3H production rates and substantial dilution by low-3H tropical oceanic moisture sources:

$${\text{Tropical}}\,^{{3}} {\text{H }} \,= \, 0.0513{\text{ wLT }} + \, 2.2882{\text{ LMF }} + \, 0.0061{\text{ AT }}{-} \, 0.0399{\text{ OLR }} \,{-} \, 0.059{\text{ CPN }} + \, 0.0003{\text{ PP }} + \, { 9}.{3437}$$
(3)

which yielded an R2 = 0.62, p < 0.01) and whose variability was explained 27% by wLT, 15% by LMF, and the remaining variance equally distributed over four climatic parameters (AT, outgoing longwave radiation [OLR], convective precipitation amount [CPN] and PP). Wherever sufficient 3H data was available, RCWIP clusters were calculated with the appropriate regionalized regression equations. For clusters with 3H data deficiency, the tropical and extra-tropical models were used as replacements for the global model. The resulting prediction models were applied and merged for each 0.1 × 0.1 degree grid cell using fuzzy clustering climatic zone membership fractions22 to create a single unified prediction map of contemporary 3H in annual global precipitation (Fig. 1).

Results and prediction model validation

Our predictive models revealed that contemporary 3H in precipitation ranged from near to below instrumental detection limits of ca. 0.5 TU to as high as 25 TU in different regions of the world and exhibiting highly distinctive geospatial patterns. The known latitudinal gradient in 3H for precipitation extending from the tropics to the poles was accurately predicted, as well as substantially higher 3H activities around the poles. The northern hemisphere showed significantly higher predicted 3H activities than the lower latitudes, equatorial areas, and the southern hemispheric land masses.

The predicted distinctive spatial global patterns depart from a commonly held belief that 3H in precipitation in the post-bomb timeframe would be uniformly low and near or below analytical detection limits worldwide. To affirm our predicted spatial patterns, we performed validation of the prediction model against 3H records from selected stations as depicted in Fig. 2. For example, the predicted 3H content for 233 stations plotted versus their measured 3H data showed our predictions were accurate and in a narrow band within ± 20% of station-based observational data, and thereby validating our model prediction results. The outliers in Fig. 2 are from continental mid-latitude areas and mainly stem from minor local anthropogenic emissions that could not be captured in the initial data screening. The mean absolute bias (MAB) of the model point-based prediction was 0.8 TU, and the absolute percentage bias was 17%. However, we did not find any systematic relationship of bias to latitude or the observed 3H values.

Figure 2
figure 2

3H predictive model validation. Left: predicted vs observed contemporary 3H contents in global precipitation expressed in tritium units (TU). Right: model 3H residuals expressed as percent deviation between predicted and observed mean tritium values vs latitude. Open circles indicate data points with biases > 20% or beyond the MAB. Figure created in R 4.1.0 (with ggplot2 3.3.5 library, all https://cran.r-project.org/).

Due to the relatively small dataset for modelling and the associated technical challenges to spatially split into calibration and validation subsets, we used several techniques to cross-validate our results. First, we tested our post-bomb 3H predictions against pre-bomb proxy 3H archives (wines and an ice core) and post-2018 precipitation 3H data with monthly records spanning < 3 years, which precluded them from the predictive model due to incorporation criteria; we found predictions matched both the archive and post-2018 data within reasonable uncertainty margins (see discussion in Supplementary Material SM3). Second, we assessed the plausibility of the predicted 3H content and isotopic gradients of the continentality effects by comparing our predictions along continentality gradients to measured “continentality endmember” stations (e.g., Yakutsk, Russian Federation), which show the highest 3H levels in all available time series from the bomb peak to present, and by considering the ratios of oceanic vs. continental 3H levels as a constant during the bomb peak and tail periods. This model confirmation analysis is presented in detail in Supplementary Material SM4.

Unique global spatial 3H patterns

Contemporary Eurasian precipitation follows the well-known 3H enrichment gradient12,24 concomitant with continentality and the decreasing degree of dilution with maritime water vapour along a west–east transect, revealing the importance of the westerlies which explain the predicted lower east–west gradient from the Pacific to Central Asia24. The 3H ratios of continental GNIP stations compared to the oceanic end member of long-term GNIP station Valentia (Ireland) on the Atlantic coast for historic, present, and predicted data showed the continental 3H end member for Central Eurasia is ca. 22 TU. Our model predictions exceeded 22 TU in some mountainous areas; yet these upper predictions need to be verified with observational data. Our model predicted elevated 3H, exceeding 15 TU, over the Himalayas and the Tibetan Plateau in agreement with excesses in cosmogenic radionuclides such as 35S25 but predicted values also need to be validated with 3H data for extreme altitudes. Similarly, the North American continent model exhibited patterns of higher 3H levels following a continentality gradient that closely mirrors the spatial distribution of stable water isotopes (δ2H, δ18O), clearly demarcating the polar vs. Caribbean influences26, and a south-north gradient with markedly lower 3H values along the fringes of the Atlantic and Pacific coastlines. These distinctive 3H patterns reflect the diluting influence of the presumably low-3H Gulf Stream and North Pacific currents, respectively. In the absence of sufficient observational data and stations in general, the location and magnitude of the North American continentality end member remains to be validated. Our model predictions, however, fully agreed with observed 3H data from the Sonoran and Chihuahua deserts27. Along similar latitudes to North America, the Mediterranean region was distinctive by an absence of open-ocean moisture 3H dilution, yet still affected by the rising limb of cosmogenic tritium production spanning from annual mean 3H levels around 2.5 TU in Gibraltar up to 6 TU in the Middle East region, along both latitudinal and longitudinal gradients. Our predictions affirmed that higher than expected 3H values in subtropical areas are related to deep convective precipitation processes, such as in the South America Chaco-Pampa region (peaking at 8 TU) and in parts of the Sahara. In these regions the coexistence of sufficiently deep high-altitude convection and elevated cosmogenic tritium at higher atmospheric levels is the most likely explanation for these patterns 28. In contrast, much of South America, Africa, the Indian subcontinent, Australia, and the Asia–Pacific regions were predicted (and verified) to have low 3H levels between ca. 2–4 TU. For some regions, like Central America, the Amazon basin, and the large archipelagos in Southeast Asia and the western Pacific, our model predicted 3H contents < 1 TU in precipitation which is at or below the analytical detection limits of most tritium laboratories29, thereby posing significant challenges for practical applications of 3H as a natural radiotracer for hydrology in these areas. Marine coastal fringes for all continents exhibited lower 3H in precipitation compared to inland and is clearly influenced by low-3H oceanic moisture impacts; yet the continental gradients are not impacted as much as higher latitudes.

In summary, our new predictive model for 3H in modern global precipitation revealed highly distinctive spatial patterns that for many parts of the world, and contrary to widely held expectations, showed significantly higher tritium activities than were anticipated. These findings support the promotion and further development of tritium as a powerful short-lived natural radiotracer in hydrological and environmental science applications, particularly for mapping recently recharged groundwater, critical zone hydrology, and groundwater vulnerability. With improved 3H detection technologies (~ 0.1 TU precision) by 3He-ingrowth or higher degrees of sample pre-enrichment, tritium coupled to other tracers will support the improvement of spatiotemporal hydroclimatic models. Moreover, our predictive maps provide reliable 3H input functions for hydrologic transport models and provide an accurate predictive baseline upon which to detect and assess potential impacts anthropogenic nuclear activities. Model prediction accuracy and precision will be improved in time with further attention to systematic 3H in precipitation sampling in data deficient regions.

Methods

3H data in precipitation

The geographical distribution of 3H sampling sites is shown in Fig. 1. Monthly 3H precipitation composite data was obtained from the GNIP database and related publications (IAEA/WMO, 2020) covering the period 2008–2018 (see additional details in Supplementary Materials SM1). This timeframe was selected to represent present-day tritium levels (i.e. post-bomb)11 covering an 11-year period falling mostly into solar cycle 24 for at least a part of the station data used18. The dataset also included 3H data from 5400 monthly precipitation samples at 91 sites not yet published in the GNIP database. Additional data was taken from the literature. The tritium data for our spatial analysis and modeling contained records from 233 sites from 2008 to 2018, but some stations have variable lengths of records (Supplementary material SM1). During our data curation we considered each station regarding its time-dependent position in the solar cycle. Recent studies suggest that the solar cycles 10–12 years impart ca. ± 20% of fluctuation in Central Europe that gets overprinted on annual 3H seasonality17,18. In our analysis we found that 3H records with at least three years of continuous monthly sampling, whether during the rising or falling limb of the solar cycle, corresponded within the 3H measurement uncertainty to the stations’ mean value for the period 2008–2018. For our mapping purposes we applied a lower 3H detection limit cutoff as < 0.5 TU which is in accordance with the lower limit of international laboratory performance29 but also corresponds to the lowest values of recent GNIP samples analyzed (from Galapagos and Ascension Islands).

Regressor datasets

Our three prediction models used the regressors used in the Regionalized Cluster-Based Water Isotope Prediction 2 (RCWIP221,22) model for δ18O and δ2H along with geographic and climatic regressors. The geographic regressors were (weighted) latitude, longitude, elevation, distance to coast and land mass fraction. Climatic regressors included air temperature, vapour pressure, precipitation amount, precipitable water, convective precipitation, wind speed and the continentality index30. Where in-situ measurements for the explanatory variables were unavailable they were obtained from the Global Historical Climate Network (GHCN)31 or NOAA-NCEP reanalysis data ((https://www.ncep.noaa.gov/). As gridded data for these regressors we used CRU-CL 2.132 and up-sampled NCEP datasets.

Climatic zone clustering and statistical processing

The 3H mapping used the code of RCWIP2, including the fuzzy clustering schema in 36 extra-Antarctic clusters of spatial proximity and climatic similarity22,23. As only a fraction of the full clustering scheme (16 of 36 clusters) could be populated with sufficient 3H data to derive a full regional model, we allowed for fallback on the tropical or extratropical regression models (Eqs. 2, 3). A spatial coverage map of regionalized tropical and extratropical models is given in Supplementary Material SM2. The RCWIP2 algorithms automatically detected the best-fitting regression equation and covariates23. However exceptionally, the use of the tropical and extratropical models had to be enforced for a few clusters to avoid runaway outlier 3H predictions, which we attributed to the spatially and climatically skewed distribution of 3H input data (Supplementary Table S2).