Introduction

Groundwater is a vital resource for sustainable social and economic development around the world1,2. Stored groundwater is generally purified and filtered during infiltration through natural soils and sediments3,4. Therefore, the quality and quantity of groundwater storage tends to be more stable than surface water. Thus, majority of people in the world primarily depend on groundwater for their drinking water5. It is reported that groundwater provides drinking water for > 1.5b people and supports approximately 40% of agriculture in the form of irrigation6. Therefore, protecting groundwater resources, including efficient use and conservation measures, are an important strategy of water resources plan in both developing and developed countries.

However, natural processes, anthropogenic activities, and climate change significantly influence the quality and quantity of groundwater. In many world watersheds, lakes, rivers, wetlands and the associated ecosystems have experienced impacts and thus, the vitality, availability, quality and quantity of these water resources face serious threats. Elevated concentrations of chemical elements and biological constituents exist in the environment, and depending on geo-environmental backgrounds, water pollutants may exhibit spatial and temporal variations7,8. Anthropogenic processes, such as discharge of untreated sewage water to water bodies, fertilization, and over exploitation, have changed groundwater quantity and quality9,10,11, leading to soil–water and air pollution12. For example, excessive application of chemical fertilizer could result in groundwater contamination10,13,14. Nitrogen leaching, namely the downward transport and percolation of nitrate from the root zone to the soil layers, is one of the aggravating causes for groundwater contamination, especially in irrigated areas15,16. However, it is still unclear how the anthropogenic and natural factors drive the change in the fluxes and storages of water, and associated groundwater quality. Consequently, a linked understanding of surface water and groundwater quality over space and time is critical for the assessment and management of such vital resources and ecosystems services, particularly in arid and semi-arid regions17,18,19,20.

Groundwater assessment is complex, involving identification of surface and groundwater quality and quantity drivers and the occurrence, interaction and determination of the probability of the occurrence of such problems21,22. In order to simulate both saturated and unsaturated water flow from the porous media, many widely used physical models of Richards equations has been developed by many researchers23,24,25. This approach could capture the groundwater dynamics25 but lacks representation of land surface information like vegetation cover. In order to address this limitation, scientists have developed various hydrological models to simulate interactions among water, soil, and vegetation26 modified the SWAT model to calculate lateral and vertical flows through soil layers to improve representations recharge of groundwater mountain region of Germany. Vazquez-Amábile et al.27 applied advanced redox zonation of an alluvial aquifer using data fusion and multivariate geostatistics. They indicated that incorrect wastewater management could cause the organic matter transfer from the eutrophicated surface–groundwater into to groundwater. Luo et al.28 modified SWAT model to simulate the change of soil moisture into groundwater and stream flow in the Muscatatuck River watershed. Baffaut et al.29 incorporated a groundwater module in the SWAT model version 2000 to simulate groundwater evaporation in the Yellow River basin. Watson et al.30 modified the SWAT groundwater module version 2005 to improve infiltration process from sinkholes to surface waters and to evaluate aquifer recharge in karst surroundings of the USA. Mckeown et al.31 and Melaku et al.32 incorporated an algorithm to consider the effect of slope and aspect on incoming solar radiation into the SWAT model in the forested watershed in the Boreal plain. Meshesha et al.33 modified the evapotranspiration in the SWAT model to account for two-way groundwater-surface water movement in estimating the groundwater table in Lethbridge and Barons area. Meshesha et al.34 extended the hydrological model for cold climate regions in order to quantify bacterial fluxes and its effect on surface water quality, and Kim et al.35 evaluated water quality consequently effect on aquatic environment. Guzman et al.36, Nguyen et al.37 and Ng et al.38 coupled the SWAT model with the three-dimensional groundwater flow model (MODFLOW) to represent groundwater flow. The main drawback of the coupled model is that the users must establish numerously formatted variables that properly fit39. Although these coupled SWAT–MODFLOW models enable simulation of groundwater recharge, aquifer evapotranspiration and groundwater levels, it is a big challenge to find a way to dynamically simulate water quality when coupling SWAT source code with MODFLOW code because the two models may have different definitions, formats and arrays of water quality variables. Therefore, such a coupled approach of the two different models requires modification of core codes, such as definitions of variables and its arrays in simulating ground water quality40. Although a plethora of models have been developed in the past few decades, most of the modelling experiences have focused mainly on understanding on the spatio-temporal groundwater storage. Appraisal of distribution and mitigation of chemical elements in the groundwater using SWAT model is still lacking.

Economic activity and human settlement in the Athabasca river basin (ARB) are being varied, and the basin is culturally vibrant and diverse as the homes for more than 150,000 residents with 13% of Aboriginal peoples40. However, the development of agriculture, recreation, forestry, conventional situ oil and gas, and mining can negatively affect the health of the river basin. Developing reliable groundwater quality model as a tool could provide a very useful insight on the potential mitigation of nonpoint source pollution into groundwater in river basins, which are especially important in assessing the pervasive high nutrients loadings from fertilization and manure application. The main objectives of this study are to: (i) improve representation of groundwater quality parameters in a groundwater nutrient module to account for nitrate (NO3) and total dissolved solids (TDS) in the SWAT model41; (ii) assess the effect to NO3 and TDS on groundwater quality status in the Athabasca River Basin; and (iii) evaluate SWAT model sensitivity and uncertainty analysis to understand the possible limitations, and recommend forthcoming directions in model formulation efforts.

Methods and materials

The study area

The research area is in Athabasca River Basin (ARB), which is located in the central part of the province of Alberta in the subarctic region (Fig. 1). Athabasca River is the second largest river in Alberta, and the largest undammed river. The mean annual discharges in cubic decameters (dam3 = 1000 cubic meters) at the points along the river are 2,790,000 dam3 at Jasper 13,600,000 dam3 at Athabasca. The river basin in general have substantial economic contribution in Canada by providing reliable freshwater supply to the people as well as for various industries, such as oil sands mining and pulp mills42. The main land cover type of the ARB is the boreal forest, which shares 82% of the total land; agriculture shares 9.5%, which is in the central portion of the watershed (i.e., Pembina, Lesse Slave, McLeod and upper parts). Overall forest, agriculture, traditional oil and gas extraction, oil sand mining, and coal mining are the major industries of the ARB41. Mean annual precipitation of the area ranges from 300 mm in the lower portion of the river basin to > 1000 mm from the headwaters, while the mean temperature ranges 1.8–5.1 °C43.

Figure 1
figure 1

Geographical location of Athabasca River Basin (ARB), Canada. The DEM of the ARB shows the two water quality monitoring stations used for model calibration for this study. The map was generated using GIS & RS (https://www.arcgis.com/index.html).

The major aquifer types include near-surface sands, buried channels and valleys, Paskapoo aquifers, and bedrock aquifers, which characterize the basin with primary and/or secondary porosities42. In this area, groundwater within unconsolidated surficial deposits consist of a mixture of a preglacial and glacial sand and gravel aquifers. In the Athabasca River Basin, groundwater recharge is spatially variable, and groundwater contributes baseflow to rivers and streams44,45. This region has extensive wetlands, including groundwater-fed wetlands (fens), and one of the world’s most ecologically significant wetlands designated by the Ramsar Convention. The ecology within the watershed is diverse as a result of the different natural regions. As the major river system, the ARB is influenced by a variety of climate, terrain and landscape characteristics found within its basin. The seasonality of climatic conditions is a major influence on river flow conditions.

Soil and Water Assessment Tool (SWAT) description and application

The SWAT model is a river basin scale model, which is developed in order to quantify the impact of land management practices in large, complex watersheds at daily time step46,47. It could be used for simulating the hydrological process, nutrient concentration, runoff generation, and sediment yield in the watershed under various landuse as well as soil scenario46,48,49. Additionally, it is used for simulating the effects of climate change, landuse and management practices on the quantity and quality of water50.

The SWAT has been used for simulating the N cycle in soils and aquifer at shallow depth51,52. In the water and soil, the N processes is dynamic. It could be added to the soil in the form of manure, bacteriological fixation, residue, and rainfalls. It may be moved out from the soil through by plant uptake, volatilization, soil erosion, denitrification, and leaching. In the SWAT model, there are various types of N pools, including inorganic forms of nitrogen, and organic forms. NO3 may transport with runoff, percolation or lateral flow and recharge into the aquifer to the shallow depth from the soil profile. In addition, NO3 can move with groundwater flow to river channels or be carried out of the shallow aquifer into the soil zone during water deficiencies. The amount of nitrate carried by water is calculated by multiplying the concentration of nitrate in the mobile water fraction by the volume of water moving in each route. The amount of NO3 discharge depends on NO3 concentration in the soil-water domain. Nitrate uptake by plants has an inverse exponential relationship with depth47.

$$NO_{{3conc,z}} = 7*\exp \left( {\frac{{ - z}}{{1000}}} \right)$$
(1)

where NO3conc,z refers to initial nitrate concentration (mg/kg) at the depth of z (mm). Only the parts of NO3 is mobile and therefore it available for the discharge through the tiles. To calculate the concentration of nitrate in the mobile water fraction, the following equation has been adopted from53,54.

$$Conc_{{NO_{{3,mobile}} }} = \frac{{NO_{{31y}} (1 - \exp \left[ { - W_{{mobile}} /(1 - \theta e)SAT_{{1y}} } \right])}}{{W_{{mobile}} }}$$
(2)

where ConcNO3,mobile refers to NO3 concentration in the movable water at a given layer (kg N/ha), Wmobile represent the amount of water lost by runoff, percolation or side flow in the layer (mm H2O), \(\theta e\) represent fraction of porosity from which anions are excluded, SAT1y is saturated water content in the soil layer (mm H2O). To obtain the transport of nitrate with runoff, percolation, and lateral flow, the following generic equation has been adopted from55.

$$NO_{3} = \beta _{{NO_{3} }} conc_{{NO_{{3,mobile}} }} Q_{x}$$
(3)

where NO3 represents the NO3 removed by the physical transport (i.e., lateral flow, runoff, percolation) (kg ha−1); βNO3 is the concentration of NO3 in the mobile water for the top 10 mm of soil/kg ha−1 considering both surface and subsurface lateral flow in the top layer; and Qx is the water physical transport (i.e. Qsurf, Qlat, Qperc). The NO3, that percolates to the shallow aquifer from the soil profile may remain in the aquifer, or it may move with groundwater flow into the main river channel or into the deep aquifer. Organic transport of N with sediment is obtained as a concentration function proposed by56 and later applied by57 to the separated runoff events. Estimating the daily organic N runoff loss is on the basis of concentration of organic N in the top soil layer, the sediment yield and the enrichment ratio of the organic nitrogen in sediment to organic N in soil layer51 see Fig. 2). In the SWAT model, water quality procedures integrate essential interactions and relationships used in the QUAL2E model58, which includes the major interactive factors such as the nutrient cycles, benthic oxygen demand, and algae production.

Figure 2
figure 2

The conceptual framework demonstrating nitrate occurrence in the groundwater (adopted from Almasri9).

TDS is all the dissolved chemicals in the water (mg/L). It can be obtained by adding all the concentrations from chemical analysis or can be measured as weight of the residue after a volume of water has been evaporated to dryness. Typically, the concentration of TDS increases with the sample depth that groundwater has traveled from the recharge area to the sample sites. High nutrient concentration in the groundwater shows contamination form different sources of pollution59.

Groundwater module in the SWAT model

Existing SWAT model divides underground saturated aquifer into shallow and deep aquifer. The balance of water in the shallow aquifer can be described47,60:

$$\mathop S\nolimits_{{sh}}^{i} = \mathop S\nolimits_{{sh}}^{{i - 1}} + \mathop W\nolimits_{{rg,sh}}^{i} - \mathop Q\nolimits_{{gw,sh}}^{i} - \mathop W\nolimits_{{rp}}^{i} - \mathop W\nolimits_{{pump,sh}}^{i}$$
(4)

where, \(\mathop S\nolimits_{{sh}}^{i}\) and \(\mathop S\nolimits_{{sh}}^{{i - 1}}\) is the volume of water stored in the shallow aquifer on the day i and i − 1, respectively, \(\mathop Q\nolimits_{{gw,sh}}^{i}\) is refers to the groundwater from the shallow aquifer on the day i, \(\mathop W\nolimits_{{rg,sh}}^{i}\) shows the volume of recharge which is entering the shallow aquifer on the day i, \(\mathop W\nolimits_{{rp}}^{i}\) is indicates the amount of water entering to the soil for the evaporation as well as transpiration on the day i, \(\mathop W\nolimits_{{pump,sh}}^{i}\) is refers to the amount of water removed from the shallow aquifer in the form of pumping on the day i. Assuming that the variation in groundwater is linearly correlated to the water table, the flow of groundwater can be represented by the following equations:

$$\mathop Q\nolimits_{{gw,sh}}^{i} = \alpha _{{gw,sh}} \mathop s\nolimits_{{sh}}^{i}$$
(5)

where, \(\alpha _{{gw,sh}}\) refers to the groundwater recession constant of the shallow aquifer. Groundwater from the shallow aquifer contributed to the river on day i is obtained as:

$$\mathop Q\nolimits_{gw}^{i} = \mathop Q\nolimits_{{gw}}^{{i - 1}} \cdot \;{{e^{{- \alpha _{{gw}} \cdot \Delta t }} }} + W_{{rg,sh}} \cdot (1 - e^{{ - \alpha _{{gw}} \cdot \Delta t}} ) \;\;\;\;if\; {S_{sh} > S_{shthr,q}}$$
(6a)
$$\mathop Q\nolimits_{{gw}}^{i}=0 \;\;\;if\; {S_{sh} < S_{shthr,q}}$$
(6b)

where \(S_{shthr,q}\) is a threshold value above which the stored groundwater flows to river channels, \(\alpha _{{gw}}\) is the base flow recession constant, \(W_{{rg,sh}}\) is the amount of recharge entering the shallow aquifer on day i, and \(\Delta t\) is the time step.

The volume of recharge entering the shallow aquifer on day i is obtained using the following equations:

$$\mathop W\nolimits_{{rg,dp}}^{i} = (\beta _{{dp}} )\mathop W\nolimits_{{rg}}^{i}$$
(7)
$$\mathop W\nolimits_{{rg,sh}}^{i} = (1-\beta _{{dp}} )\mathop W\nolimits_{{rg}}^{i}$$
(8)
$$\mathop W\nolimits_{{rg}}^{i} = (1- e{^{-{\frac {1}{\delta_{gw}}}}}) \mathop W\nolimits_{{seep}}^{i} + (1- e{^{-{\frac {1}{\delta_{gw}}}}}) \mathop W\nolimits_{{rg}}^{i-1} $$
(9)

where \(\mathop W\nolimits_{{seep}}^{i}\) is the total amount of water exiting the bottom of the soil profile on day i, and \(\delta_{gw}\) is the drainage time of the overlying geologic formations. \(\mathop W\nolimits_{{rg}}^{i}\) and \(\mathop W\nolimits_{{rg}}^{i-1}\) are the amount of recharge entering the aquifers on day i and i-1, respectively. \(\delta_{gw}\) is estimated against observed data in water table level through simulating aquifer recharge.

Likewise, from the deep aquifer groundwater, which is contributed to the stream on the day, i is obtained by:

$$\mathop Q\nolimits_{{gw,dp}}^{i} = \mathop Q\nolimits_{{gw,dp}}^{{i - 1}} e^{{ - \alpha _{{gw,dp}} }} + \mathop W\nolimits_{{rg,dp}}^{i} (1 - e^{{ - \alpha _{{gw,dp}} }} )$$
(10)

where \(\mathop Q\nolimits_{{gw,dp}}^{i}\) and \(\mathop Q\nolimits_{{gw,dp}}^{i-1}\) refer to flow of groundwater from the deep aquifer into the stream on day i and i-1, respectively, and \(\alpha_{gw,dp}\) is the groundwater recession constant of the deep aquifer.

Due to complexity of groundwater, we separate the shallow aquifer into a lower aquifer and an upper aquifer in the groundwater module of SWAT model to improve the model accuracy, similar to Shao et al.58. Thus, the groundwater flow in Eq. (6) can be replaced using upper and lower aquifers as follows

$$\mathop Q\nolimits_{{gw,u}}^{i} = \mathop Q\nolimits_{{gw,u }}^{{i - 1}} {{e^{ - u} }}+ \mathop W\nolimits_{{rg,u}}^{i} (1 - e^{{ - \alpha _{{gw,u}} }} )$$
(11)
$$\mathop Q\nolimits_{{gw,l}}^{i} = \mathop Q\nolimits_{{gw,l }}^{{i - 1}} {{e^{ - \alpha{_{gw,l}}} }}+ \mathop W\nolimits_{{rg,l}}^{i} (1 - e^{{ - \alpha _{{gw,l}} }} )$$
(12)

where, \(\mathop Q\nolimits_{{gw,l}}^{i}\) and \(\mathop Q\nolimits_{{gw,u}}^{i}\) refers to flow of groundwater from lower and upper aquifer into the stream on the day i respectively; where as \(- \alpha _{{gw,u}}\) and \(- \alpha _{{gw,l}}\) shows groundwater recession constant of the lower and upper aquifer respectively; \(\mathop W\nolimits_{{rg,l}}^{i}\) and \(\mathop W\nolimits_{{rg,u}}^{i}\) shows the volume of recharge entering into the lower and upper aquifer in the day i respectively.

SWAT model setup

Delineation processes were done in ArcGIS. Based on soil types and landuse classes, the hydrological response unit (HRU) was defined, based on similar soil, similar landuse, and slope types. About 1370 HRUs were identified for the basins. Weather data, such as, temperature, rainfall, wind speed, humidity, and radiation observation were obtained from Alberta Environment and Parks. Groundwater well and water quality data (nitrogen species and TDS) at 1,300 different monitoring stations was obtained from Alberta Environment and Parks as shown in Fig. 1. Data availability is one of the most prominent factors which affect the model accuracy. Therefore, there were two monitoring stations with > 15 samples for the selected parameters between 2004 and 2016 namely IOR-KRL-03 (ss) and IOR-KRL-04 (ss) monitoring stations. Both groundwater monitoring wells are completed in surficial sand aquifers at 3 m and 11 m respectively.

Model performance metrics and uncertainty analysis

Measured groundwater quality data for NO3 and TDS were used for model calibration and validation. To evaluate model performance, coefficient of determination (R2), Nash–Sutcliffe efficiency (NSE), and percent bias (PBIAS) were used61. The descriptions of NSE, R2, and PBIAS can be found in62 as follows:

$$R^{2} = \left( {\frac{{\sum\limits_{{i = 1}}^{n} {(O_{i} - O_{{avr}} )(P_{i} - P_{{avr}} )} }}{{\sqrt {\sum\limits_{{i = 1}}^{n} {(O_{i} - O_{{avr}} )^{2} } } \sqrt {\sum\limits_{{i = 1}}^{n} {(P_{i} - P_{{avr}} )^{2} } } }}} \right)^{2}$$
(13)
$$NSE = 1 - \left[ {\frac{{\sum\limits_{{i = 1}}^{n} {(O_{i} - P_{i} )^{2} } }}{{\left[ {\sum\limits_{{i = 1}}^{n} {(O_{i} - O_{{avr}} )^{2} } } \right]}}} \right]$$
(14)
$$PBIAS = \left[ {\frac{{\sum\limits_{{i = 1}}^{n} {Obs_{i} - Sim_{i} )*100} }}{{\sum\limits_{{i = 1}}^{n} {Obs_{i} } }}} \right]$$
(15)

where Oi refers to the ith observed value; Oavr is the average observed value; Pi is the ith simulated value; Pavr is average simulated value; n is the number of time step (days in our case).

To examine the model performance, NO3 and TDS of groundwater quality have been compared with the observed data during calibration (2009-2012) and validation (2013-2015). In order to perform parameter sensitivity and uncertainty analysis, we used Sequential Uncertainty Fitting2 (SUFI-2) algorithm in the SWAT-CUP63. The program generated various parameter sets for these selected parameters from a specified range of values using the Latin-Hypercube sampling technique (Table 1). To identify the total predictive uncertainty band of the simulated results, the SWATCUP was run several times, each one identifying a narrow parameter range for each parameter listed in Table 1, up to the point where reasonable goodness-of-fit statistics values were attained.

Table 1 The ranges of parameters included prior and after model calibration.

Results

Sensitivity and uncertainty analysis

Sensitivity test analysis is to identify the most sensitive parameters that govern NO3 and TDS in groundwater (Table 1). The range of selected sensitive parameters are presented in Table 1. Some parameters are basin scale and others are basin-based parameters. The analysis of model sensitivity can be processed to find out the relative response of the SWAT model to the changes in relative value of specific model parameters. Hence, some parameters are sensitive to control the whole system processes as the most significant parameters. Here, the most sensitive parameters governing the groundwater parameters were assessed on the basis of the values obtained during primary model calibration. The CDN.bs, r__GWNO3.gw, HLIFE_NGW.gw, and NPERCO.bsn were ranked as the most sensitive parameters among the NO3 parameters. While NSETLR1.1wq and N_UPDIS.bsn were the sensitive parameters from the N parameters, that influenced the concentration. On the other hand, the parameters listed in Table 1 were the most sensitive for TDS, groundwater conditions, and surface runoff. The results of the sensitive analysis confirmed that measured input parameters have a substantial influence on the model prediction.

Model Calibration and Validation performance

Model calibration and validation are done after the sensitivity analysis. To compare the model performance, the simulated outputs were compared with observed value. Therefore, the daily NO3 and TDS from the groundwater monitoring stations IOR-KRL-03 (ss) and IOR-KRL-04 (ss) from ARB were employed for the model calibration and validation to evaluate the model performance using SWATCUP which was recommended by Arnold et al.41. Based on the criteria of model performances rating, the value of PBIAS, NSE and R2 during model calibration and validation were employed for evaluating the model performance. To calibrate the modified SWAT module, Latin-Hypercube One-factor At a Time (LH-OAT) was employed.

Table 2 summarizes the performance statistics of the model for the daily nutrient concentration simulations for two groundwater monitoring stations. Table 2 shows satisfactory to very good for both stations with an averaged R2 of 0.74 for nitrate during calibration and 0.71 during validation. This confirmed that the model was able to capture the concentration of nutrients after model modification (Table 2 and Fig. 3). Therefore, the overall model performance of the new SWAT module for the daily nutrient concentration simulations was in acceptable range of model calibration and validation in the ARB. In contrast, a lower model performance for TDS simulation was observed at both monitoring stations, for which the value of NSE is found to be 0.58 during validation at IOR-KRL-03 (ss) and 0.48 during calibration at IOR-KRL-04 (ss).

Table 2 Model performance assessment for the daily observed data.
Figure 3
figure 3

Comparison of daily observed and simulated groundwater quality parameter (NO3/mg/L) at IOR-KRL-03 (SS) (A) and IOR-KRL-04 (SS) (B) monitoring stations.

Nitrate (NO3 )

Nitrate calibration processes happened in a nitrogen percolation coefficient of 0.5, with a denitrification threshold of water content and exponential rate of denitrification coefficient of 2.5 (Table 1). Nitrate percolation coefficient (0.223) and half-life parameter of nitrate in the shallow aquifer (116) were calibrated for each sub-basin for the values in the range of 0–1 and 0–200 day−1 respectively, resulting in 0.223 and with the mean value of 116 day−1 respectively for the whole selected ARB. The annual mean groundwater nitrate concentration is 0.31 mg/L at station IOR-KRL-03(ss) and 0.03 mg/L at station IOR-KRL-04(ss) (Fig. 3). Mineralization was adjusted by minimizing the default values of the rate factor for CMN to 0.000131. To control the depth distribution of nitrogen uptake, and to slow down simulated kinetics, the N-UPDIS was increased from the default value of 20 to 28 and therefore a large mass of NO3 was removed from the upper layers as per the report by64,65. The range of nitrogen settling in reservoirs was kept constant during the year as per66, the range was set greater than the default values from the Sava River Basin, to efficiently simulate the substantial retention in major wetlands because wetland specific retention is not applied in the extended SWAT model.

The nitrification rate decreased from the upper to lower Athabasca River Basin, which mirrors the rainfall distribution, with lower rainfall leading to lower soil saturation and lower nitrification rates. The maximum values of nitrate in groundwater of the ARB were 0.62 mg/L at IOR-KRL-03(ss) and 0.1 mg/L at IOR-KRL-04(ss) stations respectively, whereas the minimum values were 0.02 mg/L at the IOR-KRL-03(ss) station and 0.01 mg/L at the IOR-KRL-04(ss) station. The nitrification calibration resulted to satisfactory predictions of NO3 daily concentration in the two observation stations in the calibration and validation dataset. Ae per the comparison of NO3 between simulated and observed, better model performances were obtained in the model evaluation dataset. The simulated daily NO3 concentration agreed with the observed data and were acceptable range (Fig. 3). The percentage BIAS obtained from the observed and calculated daily loads also was ranked as acceptable to very good for all the selected stations (Fig. 3 and Table 2).

Total dissolved solids (TDS)

The TDS module for groundwater quality used the TDS concentrations from the selected groundwater monitoring stations in the ARB. The calibrated TDS parameters with specific ranges were presented in Table 1. Daily TDS concentrations were calibrated by adjusting the parameters related to groundwater. The model performance evaluation criteria reported by62 were used for the daily nutrient simulation as a guideline in evaluating the model performance for the daily TDS concentrations. Figure 4 shows that for the whole period of simulation, R2, NSE and PBIAS values were found to be good to very good during calibration while they were satisfactory to good during the validation (Table 2). Generally, the overall model performance for the daily TDS concentration simulations in groundwater of the ARB shows that the model could capture the observed concentrations. When comparing the calculated and observed concentrations (Fig. 4), the simulated TDS concentrations were also acceptable as both show similar trends. Yet, local inconsistencies were noticed in the river basin. The highest percentage of overestimation and underestimation of TDS in the calibration dataset occurs for high concentration observations and probably reflects the SWAT model’s representation of high level concentrations67, which may cause errors in the process of calculating TDS in the SWAT model68. The other possible sources of overestimation and underestimation of the model simulations originate from uncertainties in the input data and observed data. For example, the frequency of data collection varied from once or twice per month to once every few years. In contrast, the simulations used a daily time step. The low frequency of sampling might miss the peak or valley of nutrient concentrations.

Figure 4
figure 4

Comparison of daily observed and simulated groundwater quality parameter (TDS/mg/L) at IOR-KRL-03 (SS) (A) and IOR-KRL-04 (SS) (B) monitoring stations.

Total Dissolved Solids is an indicator of the availability of total dissolved salt and other constituents that affect groundwater quality. High TDS concentrations in groundwater can occur naturally, and may also indicate a downward movement of leachate into groundwater65,69,70,71. The average TDS concentrations in groundwater in the study area are 143 mg/L and 1421 mg/L at IOR-KRL-03(ss) and IOR-KRL-03(ss) stations, respectively. The TDS of groundwater samples ranges from 120-182 mg/L at the IOR-KRL-03(ss) station, while the TDS ranges from 1380-1500 mg/L at the IOR-KRL-04(ss) station (Fig. 4). High concentrations of TDS in groundwater reduce the palpability of water for drinking and may cause gastrointestinal pain and emetic effects in humans67. TDS is an important indicator for evaluating the quality of groundwater. High levels of TDS typically occur for hard water and might require groundwater treatment to decrease concentrations below 500 mg/L67.

The performance of model results were investigated by using the analysis of scatter plot between observed and simulated estimates at each groundwater monitoring stations (Fig. 5). The slope at the selected stations were considerably far from zero. This showed that the model prediction accuracy was enhanced by extended groundwater model. Thus, the results of the 1:1 fitting line confirmed that the extended groundwater SWAT module was effective prediction of nutrient concentration in the groundwater. The scatters are closer to the 1:1 line for the entire study time at two monitoring stations, albeit at some points relatively far from the fitting line probably due to limited availability of observed data. In general, it is worth to conclude the extended SWAT groundwater module shows better efficiency for simulating nutrient concentrations in the groundwater.

Figure 5
figure 5

Scatter plot comparison between of daily simulated and observed groundwater quality parameters at IOR-KRL-03 (SS) (A) and IOR-KRL-04 (SS) (B) monitoring stations.

Groundwater quality analysis and nutrient concentrations

Analysis of observed against simulated nitrate and TDS concentrations in ARB groundwater helps to identify sources of errors across two groundwater monitoring stations within the river basin. Observed nitrate concentrations vary significantly across the area (Fig. 3). Concentrations of NO3 in groundwater have been highly affected by emissions of both point and non-point sources in different watersheds across the world71. To assess pollution sources and quantify the loads of NO3 entering the whole river basin, NO3 load to groundwater has been simulated at various hydrogeological response units. The simulation periods were chosen based on the availability of point-source information. The long-term daily average concentrations of nitrate in groundwater attained maximum values of 0.59 mg/L at the IOR-KRL-03(ss) station during 2013. The long-term daily average concentrations of nitrate in groundwater reached a minimum of 0.025 mg/L in the year 2014. On the other hand, at station at IORKRL-04(ss) the long-term daily average NO3 concentrations recorded a maximum value of 0.1 mg/L during the year 2015, and a minimum value of 0.013 mg/L in most observation years. Generally, the concentrations of nitrate were well captured in all the groundwater monitoring stations, although some overestimations were observed at some points at IOR-KRL-04(ss) monitoring station (Fig. 3) and some underestimations were found at IOR-KRL-03(ss) and IOR-KRL-04(ss) monitoring stations. The probable reason for model overestimation is the data uncertainty such rainfall and snow distribution, in which the rainfall and snow distribution caused in nutrient concentrations. The Underestimation was probably subjected to the uncertainty of the input data and measured data. Furthermore, the observed NO3 data is not sufficient for continuous daily NO3 concentration for the entirely considered period for this study. Therefore, observed data could be sources for the errors. The simulation of NO3 removal by soil denitrification could be improved by varying the river basin parameters. However, the existing SWAT model could not represent perfectly the seasonal difference in nitrate concentrations. After carefully extending the model, the findings of this study highlight the need to improve spatial representation of nitrate concentrations in groundwater and in the parameters that influence nitrate concentrations at the watershed scale.

Concluding remarks

Groundwater is a precious natural resource supporting the existence of life on earth. However, various factors, such as soil properties, crop growth, industrial wastes, and agricultural management, can influence its quality. Development of industries and agriculture increase water use, which puts pressure on groundwater quality and may, in turn, influence the ecosystem of the Athabasca River Basin. Particularly, the use of fertilizers, manure, and industrial wastes may contribute to the pollution of groundwater and connected surface water environments. It is necessary to perform optimal management of groundwater resources in the basin.

State-of-the-art groundwater quality modelling at the ARB is recognized as a vital component of groundwater management, where further in-depth studies may be required to offer valuable insights related to groundwater condition and nutrient processes at the various spatio-temporal scales (i.e. site and region, and daily, monthly and annually). This can better support nutrient monitoring networks for river basin management, and enhance understanding of changing nutrient concentrations in the ARB. The SWAT model results could support the development of indicators for groundwater quality parameters, and also support integrated surface water and groundwater management. However, the accuracy of SWAT predictions is limited by data availability and structure of the model. This may result in errors while modeling processes in the river basin. For example, the highest percentage of overestimation and underestimation for TDS in the calibration dataset probably reflected the poor representation of SWAT model to high concentrations, as well as uncertainties of the input data and observed data, which in turn may cause errors in estimating groundwater TDS in the SWAT model. With continued monitoring of the groundwater network at regular frequencies, high quality input data would be available for better calibration and validation of the model. Furthermore, additional data would improve the representation of processes and pathways controlling groundwater pollution and therefore allow evaluation of the effect of land and water management practices to support the implementation of the best management practices.

In this study, we extended the existing SWAT model to improve modelling of groundwater quality (i.e. NO3 and TDS). A systematic calibration and validation of the SWAT model has been performed to compare the observed groundwater NO3 and TDS concentrations in the ARB. The results revealed that the new groundwater quality model in the SWAT is able to capture the daily nutrient concentrations in groundwater. The simulated results agree with the observed data with satisfactory and good performance for both groundwater monitoring stations as per the model performance measures. Thus, the process-based hydrologic groundwater quality model is an effective tool in simulating the groundwater quality dynamics (NO3 and TDS) for sustainable groundwater and surface water management in the river basin.