Introduction

Groundwater is an integral freshwater source that serves different human needs in many parts of the world. For agricultural purposes, groundwater contributes to approximately 43% of worldwide water usage for irrigation1. Groundwater is also a major water source for household activities in many areas including the countries in Africa2,3 and Asia4,5. Despite its importance, the quantity and quality of groundwater resources are increasingly threatened by both natural and anthropogenic factors. Climate change has potential impacts on groundwater such as seawater intrusion and groundwater level decline6,7. Additionally, a rapid increase in water demand and contamination from anthropogenic pollutants such as fertilizer residue and untreated wastewater has worsened this water source4,5,8.

Similar to other areas, the groundwater in the Mekong Delta of Vietnam also faces the depletion of quantity and decline of quality from anthropogenic stressors. In recent decades, the river and canal systems in this region have been highly polluted by agricultural, aquacultural, domestic, and industrial wastes9. According to the study in Hau River10, one of the sources for this contamination is the untreated wastewater from rice intensification and domestic sewage pipes that was directly discharged into surface water bodies. This has forced people in the Mekong delta area to be more reliant on the groundwater source. According to Erban et al.11, the groundwater level in the region has annually declined with an average of approximately 0.3 m/year. Previous studies have reported various groundwater pollutants in the area including arsenic contamination in An Giang province12,13, ammonium contamination in Bac Lieu province14, and nitrate and chloride contamination in Soc Trang province8. Thus, it is imperative to continuously evaluate the groundwater quality in this region to ensure that its quality can serve human demands. Groundwater quality assessment is commonly conducted by comparing each key parameter with the drinking water standard limit. However, determining each water quality parameter independently can lead to the lack of the correlation among related parameters into the groundwater quality assessment.

For a better interpretation of the data, multivariate statistical approaches such as cluster analysis (CA) and principal component analysis (PCA) are necessary to provide more insights into water quality assessment. PCA can be applied to identify the key parameters correlated with the water quality and to determine potential pollution sources while CA is generally used to group sampling sites with the commonalities of water quality properties14,15,16. Furthermore, water quality index (WQI) has been widely employed to assess water quality and its suitability for drinking purposes5,9,16,17. One limitation of this method is the subjective weights of parameters that are generally defined by experts based on the importance of these corresponding parameters. This can lead to different weighting values for a single parameter, and any small changes will significantly affect the final WQI values. For this reason, an integrated-weight method that combines information entropy and Criteria Importance Through Inter-criteria Correlation (CRITIC) method has been applied to generate more accurate and objective WQI values and to overcome the subjective issue of the traditional approach by considering the conflict between each parameter’s correlation and the changes in data size and value5,18,19.

Therefore, this study aims to evaluate the groundwater quality in the Mekong Delta area by CA and PCA, and to determine the quality for drinking purposes by integrated-weight water quality index (IWQI) methods. The findings from this research not only demonstrate the importance of applying statistical techniques in groundwater quality assessment but also provide the drinking-quality status of the groundwater in the Mekong Delta region, which is significant to further groundwater resource management.

Materials and methods

Study area description

The study area includes two provinces (An Giang and Dong Thap) and one city (Can Tho city) of Vietnam, which are located within the Mekong Delta area. The study area is in tropical monsoon climate with two distinct seasons: the rainy season (May–October) and the dry season (November–April). In Mekong Delta, the groundwater exploitation mainly occurs in the Middle-Upper Pleistocene (qp23), Lower Pleistocene (qp1) and Upper Miocene (n13) layers. Each area description is provided below.

An Giang province

An Giang province is situated in the southwest of Vietnam (10° 12ʹ N to 10° 57ʹ N and 104° 46ʹ E to 105° 35ʹ E) and borders with Cambodia in the northwest. The east and northeast borders Dong Thap province while the south and southeast borders Can Tho city. With the area of 353,668.02 ha, the terrain of An Giang is divided into two typical types: 87% of plain terrain and 13% of mountainous areas. This has facilitated the area as an important agricultural center of Vietnam, particularly rice production. According to the land use map of An Giang province in 2018, the ratio of triple and double rice crops accounted for 46.6% and 24.7%, respectively16. Using surface water for dike irrigation systems has caused the deterioration of surface water and groundwater quality in the region16,20.

Can Tho city

The key economic region of Mekong Delta comprises not only An Giang province but also Can Tho city. With the area of 1439.2 km2, Can Tho city is the largest city in the Mekong Delta. The terrain is relatively flat and suitable for rice and fishery production. Surface water pollution is a current issue of the city where wastes from domestic and aquacultural activities and food processing factories are the main pollution sources; therefore, groundwater has become an important freshwater source to meet daily demands21.

Dong Thap province

Dong Thap province has a total area of 3384 km2, comprising 2602 km2 of agricultural land, 111 km2 of forest land, 257 km2 of special-use land, and 146 km2 of residential land. It has aquaculture and agriculture as the key sectors for its economic development.

Groundwater sampling and analysis

Groundwater sampling collection and analysis were performed by the Center for Natural Resources and Environment Monitoring of An Giang province, Dong Thap province, and Can Tho city (Vietnam), in which the Ministry of Natural Resources and Environment (Vietnam) has granted a certificate of eligibility to operate environmental monitoring services. In 2019, groundwater samples were collected twice, one time in March and one time September, from 64 sampling sites along the Mekong River in Can Tho city (location denoted as CT1-CT27), An Giang province (location denoted as AG1–AG13) and Dong Thap province (location denoted as DT1-DT24). Figure 1 shows the sampling locations within study area of Mekong Delta, which is located within the Mekong River basin22. Water quality parameters including pH, total hardness, nitrate (NO3), iron (Fe), lead (Pb), mercury (Hg), arsenic (As), and coliforms were measured. Only pH was measured on-site with a portable pH meter (model: SensION + pH1, Hach, USA) while other parameters were measured in the laboratory using the standard methods23, which include the cadmium reduction method for NO3 (SMEWW 4500.NO3E:2012), the flame and electrothermal atomic absorption spectrometric method (SMEWW 3112.B:2012 and SMEWW 3113.B:2012) for heavy metals (Fe, Pb, Hg and As), and the multiple-tube method (TCVN 6178-2:1996 (ISO 9308-2:1990 (E))) for coliforms.

Figure 1
figure 1

Map of groundwater sampling sites in the Mekong Delta of Vietnam (AG1-AG13 are sampling sites in An Giang province, DT1-DT24 are sampling sites in Dong Thap province, and CT1-CT27 are sampling sites in Can Tho city) located within the Mekong River basin. The map containing the sampling locations (left) was constructed by the software QGIS version 3.14 (https://qgis.org/en/site/forusers/download.html) licensed under GNU General Public License (CC BY-SA 3.0). The modification of the map was made directly in the OpenStreetMap data layer from the QGIS database. The Mekong River basin map (right) has been modified from Mekong River Commission (2020)22 licensed under CC BY 4.0.

Multivariate statistical analysis

CA using the Ward’s linkage method was conducted to group groundwater sampling sites based on their similarity in terms of water quality. The clusters are formed based on the Euclidean distance calculated by (Dlink/Dmax) × 100, in which Dlink is the linkage distance for an individual case and Dmax is the maximum linkage distance. The number of statistically significant clusters is selected by considering the groups such that (Dlink/Dmax) × 100 < 6024. Z-score standardization was used to eliminate the effect of different units of water quality parameters. PCA was employed to extract key parameters that drive the variation of groundwater quality in this study. This method forms a new set of parameters known as principal component (PC)9,13. PC was selected based on its eigenvalue and percentage contribution to the total variation. Both CA and PCA were performed using the ORIGIN version 2019b (OriginLab, USA).

Integrated-weight water quality index (IWQI)

Entropy-weighted method

The method of information entropy to calculate weighting parameters was from Shannon25. The calculation of entropy weights of parameters was conducted as follows:

Firstly, the initial data were arranged as Eq. (1):

$$X = \left[ {\begin{array}{*{20}c} {{\text{x}}_{11} } & \cdots & {{\text{x}}_{{{\text{1n}}}} } \\ \vdots & \ddots & \vdots \\ {{\text{x}}_{{{\text{m1}}}} } & \cdots & {{\text{x}}_{{{\text{mn}}}} } \\ \end{array} } \right]$$
(1)

where, m is the number of groundwater sampling sites (i = 1, 2, …, m), n is the number of water quality parameters (j = 1, 2, …, n).

Since the units of water quality parameters were different, the value of each parameter was standardized as expressed in Eq. (2):

$${\text{y}}_{{{\text{ij}}}} = \frac{{{\text{x}}_{{{\text{ij}}}} - \left( {{\text{x}}_{{{\text{ij}}}} } \right)_{\min } }}{{\left( {{\text{x}}_{{{\text{ij}}}} } \right)_{\max } - \left( {{\text{x}}_{{{\text{ij}}}} } \right)_{\min } }}$$
(2)

where, xij is the jth analyzed groundwater parameter of ith groundwater sampling site.

Then, the standardized data were organized as Eq. (3):

$${\text{Y}} = \left[ {\begin{array}{*{20}c} {{\text{y}}_{11} } & \cdots & {{\text{y}}_{{{\text{1n}}}} } \\ \vdots & \ddots & \vdots \\ {{\text{y}}_{{{\text{m1}}}} } & \cdots & {{\text{y}}_{{{\text{mn}}}} } \\ \end{array} } \right]$$
(3)

Information entropy (ej) was computed according to Eqs. (4) and (5):

$${\text{e}}_{{\text{j}}} = - \frac{1}{{{\text{lnm}}}}\mathop \sum \limits_{{{\text{i}} = 1}}^{{\text{m}}} {\text{f}}_{{{\text{ij}}}} {\text{lnf}}_{{{\text{ij}}}}$$
(4)
$${\text{f}}_{{{\text{ij}}}} = \frac{{{\text{y}}_{{{\text{ij}}}} + 10^{ - 4} }}{{\mathop \sum \nolimits_{{{\text{i}} = 1}}^{{\text{m}}} \left( {{\text{y}}_{{{\text{ij}}}} + 10^{ - 4} } \right)}}$$
(5)

where ej is the information entropy of the jth parameter. 10–4 is used to ensure the formula is meaningful. m is the number of groundwater sampling sites.

Finally, the entropy weight (Wj1) was calculated as follows:

$${\text{W}}_{{{\text{j1}}}} = \frac{{1 - {\text{e}}_{{\text{j}}} }}{{\mathop \sum \nolimits_{{{\text{j}} = 1}}^{{\text{n}}} \left( {1 - {\text{e}}_{{\text{j}}} } \right)}}$$
(6)

CRITIC-weighted method

The CRITIC-weighted method was applied according to Zhang et al.5, the weights (Wj2) of water quality parameters were computed by Eqs. (7)–(9):

$${\text{W}}_{{{\text{j2}}}} = \frac{{{\text{s}}_{{\text{j}}} }}{{\mathop \sum \nolimits_{{{\text{j}} = 1}}^{{\text{m}}} {\text{s}}_{{\text{j}}} }}$$
(7)
$${\text{s}}_{{\text{j}}} = \delta_{{\text{j}}} \mathop \sum \limits_{{{\text{j}} = 1}}^{{\text{m}}} (1 - {\text{r}}_{{{\text{ij}}}} )$$
(8)
$${\text{r}}_{{{\text{ij}}}} = \frac{{\sum ({\text{y}}_{{{\text{ij}}}} - \overline{{{\text{y}}_{{{\text{ij}}}} }} )({\text{y}}_{{{\text{ij}}}}^{^{\prime}} - \overline{{{\text{y}}_{{{\text{ij}}}}^{^{\prime}} }} )}}{{\sqrt {\sum ({\text{y}}_{{{\text{ij}}}} - \overline{{{\text{y}}_{{{\text{ij}}}} }} )^{2} \sum ({\text{y}}_{{{\text{ij}}}}^{^{\prime}} - \overline{{{\text{y}}_{{{\text{ij}}}}^{^{\prime}} }} )^{2} } }}$$
(9)

where, Wj2 represents the weight of the jth water quality parameter, sj is the quantity of information calculated for the jth parameter, δj is the standard deviation of jth parameter, \({\text{r}}_{\text{ij}}\) is the correlation coefficient, \(\overline{{\mathrm{y} }_{\text{ij}}}\) is the average value of the parameter \({\text{y}}_{\text{ij}}\), and \({\mathrm{y}}_{\mathrm{ij}}^{\mathrm{^{\prime}}}\) is the mean value of the parameter \({\mathrm{y}}_{\mathrm{ij}}^{\mathrm{^{\prime}}}\). The values of \({\text{y}}_{\text{ij}}\) and \({\mathrm{y}}_{\mathrm{ij}}^{\mathrm{^{\prime}}}\) were normalized by Eq. (2).

Integrated weight calculation

The integrated weights (Wj) can be calculated by combining the entropy weights and CRITIC weights5, which are expressed in Eqs. (10)–(12):

$${\text{W}}_{{\text{j}}} = {\text{ pW}}_{{{\text{j1}}}} + (1 - {\text{p}}){\text{W}}_{{{\text{j2}}}}$$
(10)
$${\text{p }} = \mathop \sum \limits_{{{\text{j}} = 1}}^{{\text{m}}} \left[ {\left( {{\text{w}}_{{\text{j}}} - {\text{W}}_{{{\text{j1}}}} } \right)^{2} + \left( {{\text{w}}_{{\text{j}}} - {\text{W}}_{{{\text{j2}}}} } \right)^{2} } \right]$$
(11)
$${\text{w}}_{{\text{j}}} = \frac{{{\text{W}}_{{{\text{j1}}}} \times {\text{W}}_{{{\text{j2}}}} }}{{\mathop \sum \nolimits_{j = 1}^{m} ({\text{W}}_{{{\text{j1}}}} \times {\text{W}}_{{{\text{j2}}}} )}}$$
(12)

where, p is the preference coefficient. The results of integrated weights of water quality parameters are given in Table 1.

Table 1 Vietnamese groundwater quality standard for water quality parameters used in this study and their relative weights based on the integrated weight calculation.

Integrated-weight water quality index (IWQI)

The IWQI values were obtained as follows:

$${\text{IWQI }} = \mathop \sum \limits_{{{\text{i}} = 1}}^{{\text{n}}} {\text{W}}_{{\text{j}}} {\text{Q}}_{{\text{j}}}$$
(13)
$${\text{Q}}_{{\text{j}}} = 100 \times \frac{{{\text{V}}_{{\text{o}}} - {\text{V}}_{{\text{j}}} }}{{{\text{S}}_{{\text{n}}} - {\text{V}}_{{\text{j}}} }}$$
(14)

where n is the number of parameters, Wj is relative weight for the jth parameter (Table 1), Qj: the quality rating of the jth parameter, Vo: the observed value of jth parameter at a certain monitoring site; Vj: the ideal values which are considered “0” for drinking water except pH2,17. In the case of pH, Vj is 7.0 (neutral pH) and Sn is 8.5. Vietnamese groundwater quality standard values were used as a reference in this study (Table 1)26.

The groundwater quality is categorized into five classes based on the calculation of IWQI: excellent (the values ranged from 0 to 50), good (50 to 100), poor (100 to 200), very poor (200 to 300), and unsuitable for drinking (> 300)5,16. Based on the values of IWQI, the spatial distribution of groundwater categories was presented using the using the QGIS version 3.14 software (https://qgis.org/en/site/forusers/download.html). This was based on interpolation with the inverse distance weighted method. The sample points were weighted during interpolation, and the five classes were assigned five different colors.

Results and discussion

Hydrochemical and microbiological parameters of groundwater in the Mekong Delta

The groundwater quality data in the Mekong Delta study area are presented in Table 2 and Fig. 2. The average parameter values were compared with the Vietnamese regulation on groundwater quality QCVN 09-MT:2015/BTNMT26. The pH values of the water samples varied from 6.88 to 7.75 with an average of 7.29 ± 0.22 (Fig. 2a), which were still within the Vietnamese standard (5.5–8.5). The pH data observed in this study were similar to the ranges observed in the area by previous studies. During 2009–2018, the average pH value of groundwater samples collected in An Giang province was reported in the range of 6.7–7.2 in the dry season and 6.5–6.9 in the rainy season13. pH values in Bac Lieu province were in the range of 7.16–8.209. The difference between the pH variation observed in An Giang and the one observed in Bac Lieu could be attributed to different anthropogenic pollution in both areas since the middle-upper Pleistocene aquifer was highly exploited by industrial and household uses27.

Table 2 Groundwater quality in An Giang province, Dong Thap province, and Can Tho city in 2019.
Figure 2
figure 2

Box and whisker plot of hydrochemical and microbiological parameters in groundwater samples: pH (a), total hardness (b), nitrate (c), Fe (d), Pb (e), Hg (f), As (g), and coliforms (h). The outliers of the data are presented by the dots with their sampling locations (AG An Giang province, DT Dong Thap province, CT Can Tho city).

Total hardness in groundwater samples indicates high cation concentration, e.g., calcium and magnesium. In this study, the total hardness ranged between 23.6 and 999.7 mg/L, with an average of 231.2 ± 227.4 mg/L (Fig. 2b). Nine out of 64 collected samples (14.1%) exceeded the limit of total hardness for groundwater (500 mg/L). These sampling sites are located in An Giang province (AG1, AG3, AG4, AG7, and AG9 sites) and in Dong Thap province (DT2, DT8, DT9, and DT21 sites). Consumption of these groundwater sources without treatment can lead to health impacts. Comparing with nearby provinces observed within the close time period (2017–2020), the total hardness of groundwater observed in this study was within the range measured in Soc Trang province, which greatly fluctuated from 13 to 3080 mg/L8, and higher than the range found in Bac Lieu province, which was from 98 to 172 mg/L9.

The NO3 concentration in groundwater samples was ranging from below the detection limit (< 0.05 mg/L) to the maximum of 10.19 mg N/L with the average of 0.81 ± 1.34 mg N/L, which all were within 15 mg N/L of the Vietnamese standard (Fig. 2c). Comparable NO3 concentrations in groundwater have also been detected by the previously mentioned studies in Mekong delta with the sampling times during 2017–2020: 0.01–2.96 mg N/L in An Giang province28, 0.1–260 mg N/L in Soc Trang province8, and 0.41–1.91 mg N/L in Bac Lieu province9. Since Soc Trang province is located further in the south of this study area (adjacent to Hau Giang province in Fig. 1), and Bac Lieu province is located further in the south next to Soc Trang and Hau Giang provinces, it could be implied that the groundwater in this area of the Mekong Delta has been contaminated with mild NO3 pollution during the sampling period. An increase in NO3 concentration in groundwater could be influenced by the impacts from domestic and industrial wastewater, agricultural runoff, and excessive fertilizer application3,17,29. Previous studies showed various agricultural activities in these provinces such as rice and fruit production30,31,32; therefore, sources of nitrate pollution could be from these agricultural activities in the Mekong Delta area.

Heavy metals in groundwater samples ranged from below the detection limits (< 0.1 mg/L of Fe, < 1 \(\times\) 10–3 mg/L of As and Pb, and < 3 \(\times\) 10–4 mg/L of Hg) to 35.5 mg/L of Fe, 6.50 \(\times\) 10–3 mg/L of Pb, 4.40 \(\times\) 10–4 of Hg, and 1.98 \(\times\) 10–2 of As (Fig. 2d–g). The concentrations of these metals detected in groundwater were within the permissible limits except for Fe. Three samples, including AG1, AG11, and CT9, exceeded the limit of Fe in groundwater (5 mg/L) with concentrations of 5.55, 7.63, and 35.46 mg/L, respectively (Fig. 2d). The Fe concentrations observed in An Giang province in this study were much higher than the value observed in Bac Lieu province (1 mg/L), which is located further to the south of Can Tho city9. Although the As concentration in this study is within the standard, it was higher than that observed in the nearby Bac Lieu province. Previous studies have reported serious As contamination in An Giang province for over a decade12,15,33. With the development of hydropower plant construction in the upper Mekong River and the scarcity of surface water, the increasing use of As-contaminating groundwater is unavoidable13,34,35. High concentrations of heavy metal ions in groundwater are normally associated with both natural processes (e.g., water–rock interaction) and human activities (e.g., improper-treated industrial wastewater). The detection of heavy metals in groundwater in this study could also be from the excessive groundwater extraction to serve domestic and irrigation activities, which led to lower water levels and stronger reduction condition that triggers the release of heavy metals into aquifers36.

High coliform density was observed in the groundwater samples, which was highest at 761.5 MPN/100 mL with an average value of 33.93 ± 102.53 MPN/100 mL (Fig. 2h). There were 52 groundwater samples (81.3% of total samples) exceeding the standard limit of coliforms (3 MPN/100 mL). It is considered that fecal contamination in groundwater is ubiquitous in the Mekong Delta due to leaking fecal matter from pit latrines, livestock wastewater, and wild animal droppings via improperly protected wells, in which the coliform density can greatly vary from 9 – 9,300 MPN/100 mL13,28,37.

Cluster analysis and principal component analysis of groundwater samples

According to CA, 64 groundwater sampling sites were grouped into four clusters based on hydrochemical and microbiological parameters (Euclid distance = 20)24 as illustrated in Fig. 3. The groundwater quality characteristics of each cluster are summarized in Table 3. The cluster results also revealed that the groundwater quality in the study area depends on geological locations. Cluster I includes 15 samples (AG1-4, AG6-13, DT2, DT8, DT9), which was highly polluted with coliforms. Groundwater samples in this cluster also showed slight total hardness contamination. However, the hardness of cluster I in Table 3 is a lot higher than other clusters. Most of the samples in Cluster I are located in An Giang province and some in Dong Thap province. CT9 is the only sample classified into Cluster II, which was more polluted with Fe than other clusters. Cluster III includes most of the groundwater samples from Can Tho city from the total of 27 samples (AG5, CT1-8, CT10-27). Lastly, the remaining groundwater samples collected in Dong Thap province (DT1, DT3-7, DT10-24) are grouped into Cluster IV. As same as Cluster I, Cluster II to IV were detected with coliform contamination in groundwater.

Figure 3
figure 3

Dendrogram grouping groundwater samples based on hydrochemical and microbiological parameters. Each sample is described on x-axis and Euclid distance is represented on y-axis.

Table 3 Water quality characteristics of each cluster obtained from hierarchical cluster analysis.

The principal components responsible for the groundwater quality variation in the study area were extracted from the PCA results (Table 4). Three principal components (PC1-3) were extracted based on their eigenvalues higher than 1.0, which could explain 77.2% of the cumulative variation. PC1 accounted for 36.9% of the total variation and was recognized by moderate negative correlation with Pb, Hg, and As. This suggests the pollution from water–rock interactions and anthropogenic sources like industrial wastewater and landfill leachate, which leads to the dissolution of metallic compounds into groundwater9,13. Can Tho city is the largest city in the Mekong Delta and is well-known for its commercial, cultural, and industrialized activities while small-scale industries such as craft production can be found in Dong Thap province, leading to the contamination of heavy metals in the surface water and soil that can be leached to groundwater38,39. PC2 was associated with a moderate positive correlation with NO3 and Fe, accounting for 23.0% of the total variation. Anthropogenic sources such as excessive fertilizer use, industrial and domestic wastewater, agricultural runoff, and aquacultural wastewater are the sources of NO3 pollution3,17,29. Rice production has been reported as the dominant agricultural activity in An Giang province30 while rice starch production, fish pond, and livestock farms could be found in Dong Thap province39. These activities could then contribute to the presence of NO3 in groundwater of the study area. Since Fe can act as an electron donor in the denitrification process40, high Fe concentration in the groundwater samples can possibly reduce NO3 concentration. The third principal component, PC3, could explain 17.3% of the total variation and was characterized by a moderate positive correlation with total hardness and coliforms. Industrial wastes and natural hydrogeochemical activities are primary sources of calcium and magnesium cations41, which significantly contribute to the total hardness of groundwater. Moreover, according to the analysis, the groundwater in the study area is also susceptible to pathogenic contamination. High coliform density is associated with leaking contaminants from fecal sources such as pit latrines, sewage pipes, or livestock wastes from the farms in the study area13,28,37,39.

Table 4 Loading values of water quality parameters in each principal component.

Integrated-weight groundwater quality index for groundwater classification

The results of IWQI calculation and groundwater quality classification for drinking purposes are presented in Table 5. The IWQI values varied from 4 to 2761 with a mean of 139 ± 370. Out of 64 groundwater samples, over half (53.1%) were classified as excellent for human consumption. There were 16 (25%), 6 (9.4%), and 3 (4.7%) samples categorized as good, poor, and very poor water quality for drinking purposes, respectively. In addition, 5 samples including AG2, AG4, AG12, DT9, and DT15 had the IWQI values over 300, which were considered unsuitable for any drinking purposes. One of the reasons for groundwater having inadequate quality for human consumption is high coliform density. The previous CA results showed that all clusters were contaminated with coliforms. This was also proved by the IWQI results, which showed a high positive correlation coefficient (0.99) with coliform (Table 6). Even though As, Hg, and Pb were highly correlated with each other, these parameters were within the allowable limits. Besides, the remaining parameters have no correlation or very weak correlation with each other. Therefore, it might be deduced that changes in groundwater quality in the study area were significantly attributable to coliform density.

Table 5 IWQI values of sample sites and corresponding groundwater categories (Rank I: excellent, Rank II: good, Rank III: poor, Rank IV: very poor, and Rank V: unsuitable for drinking).
Table 6 Correlation matrix between water quality parameters and overall IWQI values.

Based on the interpolation of IWQI values with inverse distance weighting, the distribution map of groundwater quality categories for drinking purposes were constructed (Fig. 4). It can be seen that most of the samples classified unsuitable for drinking (AG2, DT15, AG4, DT9) are located near large rivers. This could imply that the groundwater quality in these areas depends on the surface water quality because contaminants can directly infiltrate into the aquifers. Therefore, agricultural runoff and the discharge of untreated or improperly treated industrial, domestic, and aquacultural wastewater into surface water could consequently contaminate the groundwater in the region. It was found that surface water and groundwater in An Giang province was heavily polluted with Escherichia coli and coliforms due to the emergence of burial swine pits in 201928; thus, the leachate containing pathogens from these sites could leak into surrounding water bodies. In this study, the majority of groundwater samples with lower IWQI values were recorded in the east and southeast parts of the study area where these sampling points are located far from the burial pits. It has been shown that the fine-grained sediments separating the water table from the vadose zone can reduce the transport of microorganisms into groundwater42, which could prevent the transfer of E. coli and coliforms from the burial pits to the sampling locations. However, since microorganisms may not be degraded by this process, the continuous flow of water in the subsurface will eventually transport them to the area43. Therefore, lower IWQI values observed in An Giang area could be the result of slow transport of coliforms. From the findings, it can be implied that groundwater quality in this area was continuously affected by human activities, and it cannot be used as a safe freshwater resource for human consumption.

Figure 4
figure 4

Spatial distribution map of groundwater classification for drinking purposes based on the IWQI values. The map was generated by the software QGIS version 3.14 (https://qgis.org/en/site/forusers/download.html) licensed under GNU General Public License (CC BY-SA 3.0).

Conclusion

This study evaluated groundwater quality in the Mekong Delta of Vietnam using multivariate statistical methods (CA and PCA) and IWQI. The results showed that the values of pH, NO3, Hg, As, and Pb in 64 sampling sites were within the Vietnamese standard. Meanwhile, total hardness and Fe concentrations in some sites exceeded the groundwater quality standard, and high coliform contamination was detected at over 80% of total samples. Sampling sites were classified into 4 clusters according to the differences in water quality parameters and geological characteristics. The results of PCA revealed that 3 PCs could explain 77.2% of the total variation of groundwater quality in this study. PC1 showed moderate correlation with Pb, Hg, and As while PC2 and PC3 had moderate correlation with NO3 and Fe, and total hardness and coliforms, respectively. Even though a very strong correlation was observed between IWQI values and coliforms, more than 50% of the sampling sites still contain groundwater with excellent quality for drinking purposes. The results of this study have partially evaluated the status of groundwater quality in the Mekong Delta area that can support policymakers in developing future strategies for water treatment and water resource management.