Introduction

Over the past decades, the impact of human activities such as the exploitation of mineral resources1, agricultural irrigation2, industrial development3, tourism and urban infrastructure4,5, and road transportation6 has led to a significant enrichment in heavy metals (HMs) in soils. Excessive amounts of HMs in the soil environment can disrupt soil functions7 and cause risks to human health through ingestion, inhalation and dermal contact8. For example, Pb, Cd and Hg can cause damage to the human nervous system and kidneys, Cu can cause neurological problems and liver disease, and Ni can inhibit the development of immune organs by inducing excessive apoptosis and inhibiting cell proliferation9. The risks posed by HMs to human health and terrestrial ecosystems have attracted widespread attention, and HMs have been listed as priority pollutants for monitoring10. Therefore, a systematic study of HMs in the soil environment is of great importance for maintaining ecological security and human health.

Mountains are the starting point for materials circulation and energy flow in global ecosystems, and they are sensitive to global climate and environmental change11. Due to the vulnerability and sensitivity of the mountain ecological environment in arid areas, mountain systems are more likely to respond to external environmental changes, and soil contamination in mountain areas by HMs can reduce the service functions and values of mountain ecosystems and even pose threats to human health12. The rapid urbanization and industrialization that has occurred in recent decades have accelerated the accumulation of HMs in the environment13. Although mountainous areas are far from urban industrial centers that release HMs, the barrier effect and condensation effect of mountainous areas induce the deposition of HMs discharged from industries into mountainous soils through long-distance atmospheric transport14, making environmental safety issues increasingly prominent15. Scientific publications have shown that the development of transportation16 and tourism17,18 has also become a key factor in the enrichment of HMs in some mountainous areas. These results demonstrate that the sources of HMs in mountain soils are diverse; therefore, quantitative and qualitative analyses of the sources of soil HMs are particularly significant for precisely controlling pollution sources19. Presently, many quantitative methods are widely applied to identify sources of soil HMs, such as the PMF model20, principal component model analysis/multiple linear regression (PCA/MLR)21, UNMIX model22, and chemical mass balance method (CMB)23. Qualitative methods such as PCA, correlation analysis and cluster analysis cannot provide the contributions of pollution sources to HMs. Additionally, the stable isotopic compositions of HMs have an excellent tracing capacity, and stable isotopes of Pb, Zn, Cd, and Cu have been widely used for contamination source identification23,24. In the above approaches, PMF is a receptor model recommended by the U.S. Environmental Protection Agency (US EPA) for pollutant source apportionment, which takes into account the uncertainty of the data matrix and provides contributions of pollutant sources to HMs under nonnegative constraints25. The PMF model has been applied to source apportionment research of pollutants in the atmosphere, sediment and soils26,27 and has achieved positive results. Receptor models combined with isotopes are expected to be an efficient way to identify HM sources when multiple HMs coexist.

The Sayram Lake Basin (Fig.1) is located in the middle of the West Tianshan Mountains in Xinjiang, which is a typical, closed watershed of the Tianshan Mountains. In recent decades, with economic development, tourism and transportation industries have rapidly emerged. As a 5A-level tourist attraction, the tourism area of Sayram Lake receives 2,530,000 tourists annually (http://www.xjboz.gov.cn/), which may cause high-intensity traffic problems and generate large amounts of tourist wastes, which lead to negative effects on the environment. However, these problems have not received sufficient attention. Existing studies on the Tianshan Mountains have focused on climate change and water resources28,29, land use and land cover30, snow and glaciers31, and paleolimnology32. The lack of research on the concentration, contamination and eco-risk characteristics of HMs and their influencing factors in the Tianshan Mountains has significantly affected the development of agriculture and pastoralism, tourism value evaluations and identifications of the ecological security of this region.

Figure 1
figure 1

Location map and distribution of sampling points for the Sayram Lake Basin. The graphs (a, b) and (c) were generated by QGIS 3.26.3 (https://www.qgis.org) with the global vector data from the GADM database (https://gadm.org), the DEM data from the Natural Earth database (https://www.naturalearthdata. com), and the land use data from the Resource and Environmental Science and Data Center database (https://www.resdc.cn). The combination of graphs (a, b) and (c) was accomplished with linkscape 1.2.1 (https://inkscape.org).

Thus, an investigation was carried out on the topsoil of the tourist area of Sayram Lake. The main objectives of this work were (1) to reveal the concentration characteristics of HMs; (2) to assess the degree of contamination and eco-risk of HMs; and (3) to identify the potential sources and contributions of soil HMs based on the PMF model and Pb isotope ratios. The results of the study are expected to provide scientific support for the prevention and control of HM pollution, for the sustainable development of the ecological environment and for the evaluation of ecological service values of mountain areas.

Results

Whole-rock geochemical compositions and grain-size characteristics

The whole-rock composition results are shown in Table S1, and the statistical characteristics are shown in Fig. 2. In general, the concentration range (%) and mean value (%) of the whole-rock composition in the topsoil of the tourist area of Sayram Lake decreased in the following order: SiO2 (34.8–62.09, 51.69) > LOI1000 (8.57–25.83, 16.02) > Al2O3 (7.99–15.18, 12.33) > CaO (1.71–21.10, 7.35) > SOC (2.61–10.50, 5.17) > Fe2O3 (1.6–3.3, 4.8) > K2O (1.6–3.33, 2.50) > MgO (1.85–3.57, 2.44) > Na2O (0.79–2.51, 1.61) > TiO2 (0.41–0.72, 0.61) > SO3 (0.10–0.73, 0.23) > P2O5 (0.14–0.35, 0.20) > MnO (0.06–0.15, 0.11).

Figure 2
figure 2

Whole-rock composition results of each sampling point in in the tourist area of Sayram Lake.

The grain-size composition characteristics of the topsoil samples are shown in Fig. 3. According to the Udden-Wentworth classification33, the soil grain size was divided into five classes: clay (< 4 μm), fine-silty (4–16 μm), silty (16–32 μm), coarse-silty (32–64 μm) and sandy (> 64 μm). The percentage of grain-size composition of each level from large to small was fine-silty (36.49%) > clay (24.64%) > silty (19.98%) > coarse-silty (12.11%) > sandy (6.78%).

Figure 3
figure 3

The grain-size composition of the topsoil in the tourist area of Sayram Lake.

Heavy-metal concentrations and Pb isotope ratios

The descriptive statistics for Cr, Cu, Ni, Pb, Zn and Cd in the topsoil of the tourist area of Sayram Lake are shown in Table S2, and the statistical characteristics are shown in Fig. 4. The highest concentration of HM was that of Zn, which ranged from 62–141 mg/kg, and the mean value was 101 mg/kg. The lowest concentration of HM was that of Cd, which ranged from 0.2–1.08 mg/kg, with an average value of 0.34 mg/kg. The average concentrations of Cr, Cu, Ni and Pb were 53.5 mg/kg, 27.48 mg/kg, 27.89 mg/kg and 24.59 mg/kg, respectively. Additionally, the average Pb and Zn concentrations exceeded the background values by 33% and 50%, respectively. The mean concentration of Cd was 2.8 times that of the background value (0.12 mg/kg). Furthermore, the CVs of Cd, Pb and Zn were 0.58, 0.26 and 0.2, respectively, while the CVs of the other elements were less than 0.2. These results demonstrated that Cd, Pb and Zn had strong variability in their mean values, and the variability in Cd was most significant.

Figure 4
figure 4

Concentrations of HMs in the topsoil of the study area.

The descriptive statistics of the Pb isotope ratios (206Pb/207Pb and 208Pb/206Pb) in the topsoil of the tourist area of Sayram Lake are shown in Table S2. The results showed that 206Pb/207Pb ranged from 1.09 to 1.21, with an average value of 1.15. 208Pb/206Pb ranged from 2.01 to 2.18, with an average value of 2.11.

Contamination and eco-risk evaluation of HMs

The accumulation of HMs in soils may have a negative impact on the ecological environment. Therefore, researchers evaluate the effects of HMs on the environment based on a certain system of evaluation criteria and corresponding mathematical models. A single evaluation method may lead to errors in the results, so multiple evaluation criteria (Igeo, PERI and PLI) were applied to evaluate the topsoil in the tourist area of Sayram Lake to strengthen the credibility of the assessment results.

The average Igeo values of HMs in the topsoil of the tourist area of Sayram Lake decreased in the following order: Cd (0.80) > Zn (−0.03) > Cu (−0.19) > Pb (−0.21) > Ni (−0.44) > Cr (−0.70). From the average value of Igeo, the area was slightly contaminated with Cd (0.8), and the average value of Igeo of the other elements was less than 0, showing no contamination. From a more microscopic point of view, 22.2% of all sample points were moderately Cd-contaminated, these points were concentrated in the southeast of the study area (ss04, ss05, ss06, ss07), and the rest of the sample points were slightly contaminated. For Pb, there were two sample points with moderate contamination, and the Igeo of the remaining points was less than 0. For Zn, 39% of the samples were slightly contaminated. The Igeo values of Cr, Ni and Cu were less than 0, indicating that these HMs caused no contamination of the tourist area of Sayram Lake.

Eir can indicate the risk of individual HMs. Referencing the grading criteria for Eir, Cr, Cu, Ni, Pb and Zn showed values of Eir that did not exceed 40, indicating that they posed low risks. The mean Eir of Cd was 85.69, which indicated a considerable risk. The PERI of Cd ranged from 67.39 to 295.80, with only one sample site (ss04) having a PERI (295.80) greater than 150, which was a moderate ecological risk area. The PLI ranged from 1.04 to 1.89, suggesting a moderate contamination level in the tourist area of Sayram Lake.

Discussion

The plots of Igeo, PERI, and PLI of HMs in the topsoil of the tourist area of Sayram Lake (Fig. 5) reveal the degree of HM pollution and eco-risk in this study area on the one hand and, on the other hand, indicate the direction for the relevant agencies to target soil environmental protection and HM pollution prevention and control measures. In this study, the Igeo results showed that Cd was the most highly enriched HM, and Pb, Zn, Cd, and Ni were slightly enriched in a few sample sites. The unnatural accumulation of these elements is usually closely associated with human activities in the area34. Tourism is the main economic activity in the district, and published studies have reported that tourism infrastructure construction (e.g., roads, buildings, etc.) and tourism wastes (e.g., plastic bags, batteries, hotel wastewater) release Cd into the soil35. Additionally, the accumulation of Pb, Zn, Cu and Ni in soils is usually associated with traffic emissions36. The PERI showed that the study area was at low risk overall, with only point ss04 exhibiting medium risk; however, this result was caused by the abnormally high Cd concentration value (Fig. 4) at point ss04 (Cd (concentration): 1.08 mg/kg, Cd (background): 0.34 mg/kg). This anomalous concentration value has a large influence on the PERI calculated based on the measured concentration, the background value and the toxicity coefficient. Therefore, references to this point can be appropriately removed when considering eco-risk. The PLI of each sampling point was greater than 1 and less than 2, which means that the area was in a moderately contaminated state. In general, the degree of soil HM contamination in this area was low; however, due to HM toxicity, bioaccumulation, and persistence37, the HM contamination of this area still requires sustained attention.

Figure 5
figure 5

Contamination and ecological risk indices: (a) geoaccumulation index (Igeo) of HMs; (b) ecological risk of individual HMs; (c) potential ecological risk index (PERI) of HMs; (d) pollution load index (PLI) of HMs.

Correlation analysis is an efficient way to reveal correlations among HMs through Pearson correlation coefficients, and HMs with significant correlations may originate from the same source38. As shown in Table S5, the elemental pairs Cd-Cu (p < 0.01), Cd-Ni (p < 0.01), Cd-Zn (p < 0.05), Ni-Cu (p < 0.01), Pb‒Zn (p < 0.01), Cr-Ni (p < 0.01), Cr-Pb (p < 0.05), Cr-Zn (p < 0.05), Zn-Ni (p < 0.05), Zn-Cu (p < 0.01) were significantly and positively correlated. These results suggest that the elemental group Cd-Cu-Ni-Zn can be considered the same source. The significant correlation of Pb‒Zn implies that they have homologous characteristics, and published studies have reported that Pb and Zn are usually enriched by traffic emissions20. However, Cr, Ni, Pb and Zn may also share similar sources, and this result is probably caused by mixed sources of HMs, since natural sources, human activities in the study area, or atmospheric deposition may contribute to HM concentrations. Conversely, Cd showed no correlation with Cr and Pb, which indicated that they were controlled by different sources. Therefore, the PMF model was employed to further identify the sources of HMs and to quantify the contribution of each source to each HM.

The HM concentration data file and uncertainty data file were used as input data according to the EPA PMF5.0 user's guide. To ensure the accuracy and reasonableness of the model results, the factors were set to 2, 3 and 4, and the random starting seed number of the default model was run 100 times. It was determined that the explanatory rate of the model was best with 3 factors. In addition, the coefficient of determination R2 for all observed and predicted values of HMs was greater than 0.7, with a minimum value of 0.74 and a maximum value of 0.99. Bootstrapping (BS) and displacement of factor elements (DISP) were employed to evaluate the bias and uncertainty of the PMF results39. The results showed that over 90% of the base factors were reproduced in the BS model, and no factor swaps were observed in the DISP model within the lowest maximum permitted change in Q (dQmax). Therefore, it is reasonable and valid for the PMF model to explain the information contained in the original data using 3 factors.

The PMF model source apportionment results are shown in Fig. 6, where Factor 1 has high loading values for Pb and Zn (39.7% and 32.0%, respectively), implying that they were influenced by the same sources. The mean concentrations of Pb and Zn exceeded the background values for the tourist area of Sayram Lake, while the CVs were 0.26 and 0.2, respectively, which were medium variation levels, indicating that they might be influenced by human activities40. In addition, the Igeo indicated the presence of slight level of Pb and Zn contamination at some sampling points in the tourist area of Sayram Lake, which suggests the existence of external sources of Pb and Zn. Published studies have shown that Pb emissions from vehicle exhaust account for two-thirds of total global Pb emissions41; despite the global ban on the production and use of leaded gasoline since 2000, vehicle exhaust emissions are still considered to be the main cause of Pb accumulation in soils42. Additionally, the wear and tear of brakes, bearings and tires of automobiles also promote the release of Pb and Zn into the soil environment43. The soil samples collected in this study were located in the tourist area of Sayram Lake, where tourist visits reach values of 2,530,000 per year and may contribute significantly to local traffic. The frequent vehicle traffic on the G30 highway, emissions from vehicle exhaust and the wear and tear of parts may promote the deposition of Pb and Zn in the topsoil. Therefore, Factor 1 was inferred to be a traffic source.

Figure 6
figure 6

Sources and contributions of HMs in the soil in the PMF model.

Factor 2 was dominated by Cr, Cu, Ni, Pb, and Zn, with loadings of 52.2%, 45.8%, 48.9%, 37.1%, and 36.1%, respectively. The average concentration of Cr was lower than the background value, and the average concentrations of Cu and Ni slightly exceeded the local background values. Furthermore, they have small coefficients of variation, indicating low influence from external pollution sources. Notably, Pb and Zn showed high loadings in both Factor 1 and Factor 2, indicating that Pb and Zn were from mixed sources and were controlled by both Factor 1 and Factor 2. Previous studies have shown that Cr, Cu and Ni in soils commonly originate from the soil parent materials and are controlled by the geological background material and soil-forming processes44. Many studies also support this view45,46. Therefore, Factor 2 was inferred to be a natural source.

Factor 3 was mainly characterized by Cd, Ni, Cu and Zn, with loadings of 71.1%, 31.0%, 29.9% and 31.9%, respectively. In this study, the mean concentration of Cd was 2.8 times the local soil background value, with a CV of 0.58, which was a strong level of variation in relation to the mean, indicating that Cd may be influenced by anthropogenic sources. According to previous studies, the accumulation of Cd in soils is usually associated with agricultural activities (sewage irrigation, pesticide spraying and fertilizer use)47, plating of automotive lubricants and brake pads48, coal combustion44 and industrial ore smelting49. In addition, researchers have found that tourism activities can also increase levels of HMs in soils. In a study conducted by Wang et al.50 on the effect of tourism activities on soil quality in the scenic area of Mount. Tai, Shandong Province, it was found that tourism activities accelerated the accumulation of HMs, with the average content of Cd exceeding the background value by 36.2%. Enrichment of HMs in soils under tourism loading conditions was also reported in another study, where local soils were highly contaminated with Cd4. An investigation of the study area showed that there were no mineral smelting activities or agricultural activities in the basin. The enrichment of Cd may originate from HM-containing wastes from tourism activities (batteries, plastics, wastewater, etc.) and atmospheric deposition (e.g., deposition from the wear and tear from motor vehicles and combustion of coal for heating). Therefore, Factor 3 can be considered a mixed effect of the above sources.

Figure S1 shows the contribution of three factors (F1: traffic source, F2: natural source, F3: tourist waste and atmospheric deposition) to HMs. The contributions of the three factors to Pb and Zn in the soil in a descending order were F1 ≈ F2 > F3. Cr, Cu and Ni were mainly influenced by F2. Cd was controlled by F3 (71.1%). Overall, it seems that the HMs of the topsoil in the tourist area of Sayram Lake were influenced by multiple sources, with the greatest contribution from F2 (38.5%), followed by F3 (34.3%) and F1 (27.2%). Notably, F3 has the greatest capacity to release Cd into the soil environment, and F3 control should be considered a priority.

To validate the results of the PMF model, the sources of Pb in the soil were further analyzed using the Pb isotope method. The Pb isotope ratio results confirmed that the Pb present in the topsoil of the tourist area of Sayram Lake was influenced by anthropogenic sources. As shown in Fig. 7, the Pb isotope ratios in soil samples from the tourist area of Sayram Lake were plotted with those in other related environments (206Pb/207Pb vs. 208Pb/206Pb). The Pb isotope ratio data of relevant environments mainly include Pb‒Zn ores from the Tianshan Mountains in Xinjiang51, vehicle exhaust emissions in China52,53, urban road dust in Xinjiang54 and dustfall in the Tianshan Mountains55. Figure 7 shows that there were significant differences in the Pb isotope ratios of different regions. The sampling points near G30 had higher 206Pb/207Pb ratios and lower 208Pb/206Pb ratios. The sampling points near side roads had lower 206Pb/207Pb ratios and higher 208Pb/206Pb ratios. The Pb isotope ratios at the sampling sites located near the G30 main road were similar to those of vehicle exhaust emissions, urban road dust and dustfall, and the results indicated that Pb enrichment was strongly related to traffic emissions. Increased human activity has accelerated the accumulation of HMs in the environment, and isotopic methods can identify the sources of HMs based on the similarities they exhibit14. By integrating the Pb isotope data in each environment, it was determined that the Pb in the soil of the tourist area of Sayram Lake was partially derived from traffic emissions. This result validated the plausibility of factor 1 in the PMF model being a traffic source.

Figure 7
figure 7

Comparison of Pb isotopic ratios in the topsoil with other relevant studies reported.

A comparison with other tourist areas (Table 1) yielded two distinctive findings: Pb, Zn, Cd and Cu are common pollutants that can easily accumulate in soils in tourism areas; the enrichment of HMs is strongly related to the type of local land use and human activities. Differences in human activities and the geographic contexts of each area lead to differences in the influencing factors and enrichment levels of HMs56. To safeguard the soil environmental, HMs that are easily enriched in tourist areas should be highlighted for monitoring, and anthropogenic sources around tourist areas should be reasonably controlled.

Table 1 Comparison of the main pollutants and sources in the topsoil of the Sayram Lake tourist area and other tourist research areas.

The material composition and structure of the soil influence the concentration of HMs in soils. To investigate the effect of soil composition and structure on HMs under natural conditions, data including whole-rock composition, grain size, soil organic carbon and loss-on-ignition were used for redundancy analysis with HMs. Figure 8(a) reveals the relationship between SiO2, LOI1000, Al2O3, CaO, Na2O, TiO2, SO3, P2O5, MnO, K2O, Fe2O3, MgO, SOC and Cd, Cr, Cu, Ni, Pb and Zn. The results showed that the content of HMs was related to SiO2, Al2O3, TiO2, P2O5, MnO, K2O, Fe2O3 and SOC. Among them, Pb was most closely related to Al2O3 and SOC, and previous studies have shown that soil organic matter has a strong adsorption capacity for HMs57. Zn, Cr, Cu and Ni were closely related to Fe–Mn minerals and silicates, which was similar to the results of Ma et al.58. Additionally, this also indicated that HMs were weakly affected by carbonate and sulfate. The relationship between HMs and grain size (Fig. 8b) showed that HMs tended to be stored in soil particles with grain sizes < 32 µm. Among them, Cd, Ni, Cu and Cr were associated with clay (< 4 µm) and silt (16–32 µm), while Pb and Zn were associated with a fine-silty (4–16 µm) grade.

Figure 8
figure 8

Redundancy analysis of HMs and whole-rock composition, soil organic carbon, and soil grain-size composition.

The findings revealed that the enrichment of Cd, Pb and Zn in the topsoil of the tourist area of Sayram Lake was influenced by anthropogenic sources. The results of the contamination and eco-risk evaluation suggest that the potential risks posed by the high Igeo and Eir values of Cd should be treated seriously by relevant authorities. Intensive visitors and traffic will promote the release of HMs that accumulate in the topsoil to contaminate the soil environment and weaken its functions. Tourism quotas have been suggested to reduce soil contamination and degradation17. However, for traffic emissions of HMs beyond individual or regional control (e.g., lead in gasoline, car brakes, brake pads and tires), authorities may change their approach and undertake soil remediation. For example, the hyperaccumulation function of plants can be used to reduce HMs in soils or appropriate amounts of soil conditioners can be added59 to reduce the polluting capacity of HMs by agglomerating them through adsorption and complexation effects.

In this study, the concentration characteristics, potential sources, contamination levels and eco-risk assessment of HMs were analyzed by collecting topsoil samples from the tourist area of Sayram Lake, but some limitations exist. First, a small number of topsoil samples were collected in this study; therefore, they might not adequately reflect the environmental status of the study area and reveal the spatial distribution characteristics of HMs. In addition, the objects of this study only involved Cd, Cr, Cu, Ni, Pb and Zn in the soil, and the absence of studies on other HMs (e.g., Hg, Co, W, etc.) and metalloids (As) has limitations for gaining an overall understanding of the soil environment in this study area. Additionally, HM source apportionment analysis should be combined with multiple source analysis methods to avoid contingency and subjectivity of the results. For example, quantitative analysis methods such as PCA-MLR and CMB can be combined to investigate potential sources and contributions of HMs, and isotope tracing methods for Cd, Cu and Zn can be used to reveal the sources of HMs more accurately. It is worth noting that there are some differences in the pollution evaluation results of the PERI and PLI methods that were caused by the different evaluation standards and thresholds adopted by the different methods. Therefore, the regional applicability of pollution evaluation methods is challenging, and subsequent studies on the toxicological effects of HMs in the region could be carried out to determine the pollution evaluation criteria and thresholds suitable for this region.

Conclusions

This study examined Cd, Cr, Cu, Ni, Pb and Zn in the topsoils of the Sayram Lake tourist area in Xinjiang, discussed their concentration characteristics, contamination levels, eco-risk characteristics and potential sources, and reached the following conclusions:

  1. 1.

    Cd, Pb and Zn were generally enriched in the topsoil, and their mean concentrations were 2.83, 1.33, and 1.5 times higher than the local background values, respectively. In contrast, the average concentrations of Cr, Cu and Ni were close to the corresponding background values.

  2. 2.

    The calculated Igeo and Eir indicated that the topsoil was more severely contaminated with Cd. The PERI suggested that the eco-risk in this study area was low, and the PLI in the tourist area of Sayram Lake ranged from 1.04 to 1.89, with a mean value of 1.39, indicating a moderate contamination level.

  3. 3.

    Three sources of soil HMs were identified and apportioned by combining the PMF model and Pb isotope ratios, namely, traffic sources (27.2%), natural sources (38.5%), and mixed sources (tourism waste and atmospheric deposition) (34.3%). The source apportionment results showed that Cr, Cu, and Ni were mainly derived from natural sources, Pb and Zn were attributed to traffic sources and natural sources, and Cd was associated with tourist wastes and atmospheric deposition.

  4. 4.

    The utilization of whole-rock composition and grain size data of the soils was an efficient way to study the influencing factors for HMs. The content of HMs in the topsoil of the tourist area of Sayram Lake was influenced by SiO2, Al2O3, TiO2, P2O5, MnO, K2O, Fe2O3 and SOC. Additionally, the content of HMs was also influenced by soil grain size, and HMs tended to be stored in soil particles with grain sizes < 32 µm.

Materials and methods

Geographical setting

The Sayram Lake Basin (Fig. 1) is located on the northern slope of the Tianshan Mountains and in the hinterland of the Eurasian continent. The basin belongs to Bole city, adjacent to Wenquan County, Horgos city, Huocheng County and Yining County, with a drainage area of 1408 km2 60. The area has a temperate alpine climate type, with an annual average temperature and an annual average precipitation of 0.5 °C and 350 mm, respectively61.

The vegetation type of the Sayram Lake watershed is relatively homogeneous, with the lake area making up 24.8% of the watershed area, and the rest of the land is mainly covered by high-, medium- and low-coverage grassland (59.6%), woodland (5.2%), bare rock (8.7%), and Gobi (1.5%) compositions (Fig. 1). The tourist area of Sayram Lake is popular among tourists, with an average daily admission of nearly 7,000 visitors. The G30 highway runs along the southeast side of Sayram Lake through the basin to bring superior transportation conditions, while the construction of a 79 km-long tourist road around the lake promotes tourism.

Sample collection

To study the accumulation of HMs caused by human activities in Sayram Lake Basin, sampling sites were predefined in this study based on the traffic deployment in the Sayram Lake watershed and the activity areas of tourists in this tourist area. In total, 18 topsoil samples were collected from the study area, and topsoil samples were collected from 0–20 cm in October 2020, and GPS was used for geographic positioning. To prevent the topsoil samples from becoming cross-contaminated, the collected topsoil samples were immediately placed in polyethylene bags and sealed for storage. The stainless-steel drill bit was cleaned between samples.

Sample preparation and laboratory analysis

Soil samples collected in the field were naturally dried at room temperature in the laboratory. Then, impurities were removed from the soil samples, and the soil was crushed using an agate mortar and passed through a 200-mesh nylon sieve. The processed soil samples were sealed in polyethylene bags for further chemical analysis. The determination and analysis of HMs and whole-rock, organic carbon and Pb isotopes in the soil samples were performed at ALS Minerals-ALS Chemex (Guangzhou, China).

The soil samples (0.125 g) were digested in three stages using HCl-HNO3-HF-HClO4. First, HNO3 and HClO4 were used for preoxidation. Then, HF was added, and the mixture was heated in an electric furnace. Finally, the remaining solution was diluted with HCl. The concentrations of Cr, Cu, Ni, Pb and Zn were determined by inductively coupled plasma atomic emission spectroscopy (ICP‒AES) (Agilent, 5110, USA). The concentrations of Cd were determined by inductively coupled plasma‒mass spectrometry (ICP‒MS) (Agilent, 7900, USA).

An X-ray fluorescence spectrometer (PANalytical, PW2424, Netherlands) was used to analyze the whole-rock composition of the soil, and the ME-XRF26 method provided by ALS Minerals-ALS Chemex (Guangzhou, China) was applied to determine the Al2O3, CaO, Fe2O3, K2O, MgO, MnO, Na2O, P2O5, SiO2, SO3 and TiO2 content percentages. The loss-on-ignition of soil samples was determined based on the OA-GRA05x method provided by ALS Minerals-ALS Chemex (Guangzhou, China) at 1000 °C (LOI1000). The detection limit for loss-on-ignition was 0.01%, and the relative deviation and relative error of the method were both less than 5%. A carbon–sulfur analyzer (LECO, CS844, USA) was used for soil organic carbon analysis. The soil samples that were digested by HCl were separated and filtered for organic carbon with a porous crucible. The crucible was cleaned with deionized water, dried and placed in an infrared induction furnace to quantitatively detect the percentage content of soil organic carbon. The detection limit for soil organic carbon was 0.02%, and the relative deviation and relative error of the method were less than 5% and 3.5%, respectively.

The Pb isotope ratios (208Pb/206Pb and 207Pb/206Pb) of the digested soil were determined by ICP‒MS (Agilent 7700x), and the instrument was calibrated using a reference material (SRM981-NIST, USA) and standard material (GBW04426, China) for quality control. The relative standard deviations (RSD) of 208Pb/206Pb and 207Pb/206Pb were less than 0.02 for multiple measurements of the reference material GBW04426.

Soil grain size was measured by a Bettersizer laser grain-size analyzer (BT-9300SE) manufactured in China, and the measurement accuracy error and repeatability error were both less than 1%. According to the Udden-Wentworth33 grain size classification standard, the soil grain size in this study included clay (< 4 μm), fine-silty (4–16 μm), silt (16–32 μm), coarse-silty (32–64 μm) and sandy (> 64 μm) grades.

Quality assurance and quality control

To ensure the accuracy of the experimental data, monitoring materials (blank samples, duplicate samples and standard materials (MRGeo08, GBW07179) were inserted in each testing batch for HM testing of soil samples. The detection limits for HMs (Cr, Cu, Ni, Zn, Pb and Cd) were 1 mg/kg, 0.2 mg/kg, 0.2 mg/kg, 2 mg/kg, 0.5 mg/kg and 0.02 mg/kg, respectively, and the analytical errors were less than 5%. The limits of detection for whole-rock compositions (Al2O3, CaO, Fe2O3, K2O, MgO, MnO, Na2O, P2O5, SiO2, SO3 and TiO2) were 0.01%, and the analytical errors were less than 5%.

Statistical analysis methods

In this study, R 4.0.4 software was used for descriptive statistics, descriptive data such as the mean, maximum and minimum values, and coefficient of variation (CV) were calculated, and Pearson correlation analysis was used for preliminary source analysis. Origin 2021b software was used to map the whole-rock composition and particle size characteristics, elemental concentration characteristics, ecological risk assessment results and comparative Pb isotopes of the soil samples. Canoco562 was applied to complete redundancy analysis between HMs and whole-rock elements and grain size.

Ecological risk assessment methods

(1) Geoaccumulation index (Igeo). To assess the degree of HM contamination in the topsoil of the tourist area of Sayram Lake, Igeo was used as an indicator for evaluation, and it was defined as follows63:

$${I}_{geo}={{\log}}_{2}\left(\frac{{C}_{i}}{1.5*{B}_{i}}\right)$$
(1)

where Ci represents the measured concentration value of element i, and Bi represents the local background value of element i. The background values of HMs in this study were referenced from the 1994 statistics of the China National Environmental Monitoring Centre64 in the Sayram Lake Basin (Cd: 0.12, Cr: 57.3, Cu: 20.7, Ni: 24.9, Pb: 18.5, Zn: 67.3, Unit: mg/kg). Igeo is divided into 7 categories65, indicating different levels of contamination, and the classification information is presented in Table S3.

(2) Pollution load index (PLI). The PLI method was originally developed by Tomlinson et al. and is now widely used for soil HM contamination assessment66. The calculation is as follows:

$$C{F}_{i}=\frac{{C}_{i}}{{B}_{i}}$$
(2)
$$PLI={\left(C{F}_{1}*C{F}_{2}*C{F}_{3}*\cdots *C{F}_{n}\right)}^\frac{1}{n}$$
(3)

where CFi represents the pollution index value of a single factor, Ci denotes the measured concentration of pollutant i and Bi refers to the background value of element i. A PLI less than 1 indicates no contamination, and a PLI greater than 1 and less than 2 indicates moderate contamination.

(3) Potential ecological risk index (PERI). The PERI method was originally developed by Hakanson67, it integrates the ecological and toxicological effects of HMs, and it is widely used for the assessment of the eco-risk and contamination levels of the soil environment by HMs68.

$${C}_{r}^{i}=\frac{{C}_{i}}{{B}_{i}}$$
(4)
$${E}_{r}^{i}={T}_{i}*{C}_{r}^{i}$$
(5)
$$PERI=\sum_{i=1}^{n}{E}_{r}^{i}$$
(6)

where Ci denotes the measured concentration of pollutant i, Bi denotes the background concentration of element i, Ci r denotes the enrichment coefficient of the element, Eir denotes the eco-risk of the individual pollutant, and Ti represents the toxicity coefficient, which was derived from a previously published study for Cd, Cr, Cu, Ni, Pb, and Zn (the values were 30, 2, 5, 5, 5, and 1, respectively)69. PERI is classified into 4 risk levels, ranging from low risk to significantly high risk (Table S4).

Positive matrix factorization (PMF)

The PMF model is a receptor model recommended by the US EPA for source analysis of environmental pollutants70,71, which was used in this study for source analysis of HMs in the tourist area of Sayram Lake. The basic equation of the PMF receptor model is as follows:

$${X}_{ij}=\sum_{k=1}^{p}\left({g}_{ik}{f}_{kj}+{e}_{ij}\right)$$
(7)

where Xij denotes the measured concentration of HM j in sample i, gik denotes the contribution of the kth source to the ith sample, fik represents the eigenvalue of contamination source k to the jth HM concentration, and eij is the residual matrix. The optimal matrix of g and f can be obtained by continuously minimizing the objective function Q. The calculation of Q is as follows:

$$Q={\sum_{i=1}^{n}\sum_{j=1}^{m}\left(\frac{{e}_{ij}}{{u}_{ij}}\right)}^{2}$$
(8)

where uij is the uncertainty, and its calculation depends on whether the concentration of HMs is lower than the method detection limit (MDL). When the concentration of HMs exceeds the MDL, uij can be calculated according to the following equation:

$${u}_{ij}=\sqrt{{\left(error\, fraction*c\right)}^{2}+MD{L}^{2}}$$
(9)

Otherwise, uij is calculated by the following equation:

$${u}_{ij}=5/6*MDL$$
(10)

where c is the concentration of the HMs, MDL denotes the method detection limit, and the error fraction is the percentage of measurement uncertainty.