Introduction

Haze is present in most urban regions of the world and is a global environmental problem. The rapid increase in the urbanization and industrialization of many cities in China has caused haze pollution, which is dominated by the enhanced concentration of PM2.5 (particulate matter with an aerodynamic diameter ≤ 2.5 μm). PM2.5 has significant adverse impacts on public health, the environment, and climate because of its small diameter, high activity, and ability to transport noxious substances in the air with long residence times1,2,3. Epidemiological studies have confirmed that PM2.5 can induce various respiratory and cardiovascular diseases, impair the body’s immune system, and increase the mortality of people exposed to it4,5. More than 1.3 million people in China die prematurely every year due to prolonged exposure to polluted air, which is approximately 40% of the global total6. The International Agency for Research on Cancer (IARC) classified PM2.5 as a human carcinogen in 20137. Moreover, continuous haze damages the overall image of a city and weakens the attraction of tourism, talent, and investments, which restricts the sustainable development of the economy8,9,10. The issues related to cleaning the air have aroused attention both domestically and globally.

PM2.5 has attracted significant attention from the atmospheric and climate science community, and many studies have explored environmental issues related to it, including spatiotemporal distribution and drivers. Research on the spatial patterns of PM2.5 mainly focuses on environmental coping policies by identifying its distribution patterns and spatial effects. These studies reveal that PM2.5 has regional, cumulative, and compound effects and possesses significant spatiotemporal variability11,12,13. In addition, PM2.5 is not restricted to the local environment and can be diffused or transferred to neighboring areas largely through external forces, such as atmospheric circulation causing spillover effects14. The identification and estimation of PM2.5 with spatial characteristics should be studied by means of spatial analytical methods rather than by traditional statistical theory based on independent observations15. Spatial statistics started in the 1970s with the goal of understanding spatial dependence, spatial association, and other relations among data related to geographical locations with wide applications in many fields. Accurate identification of PM2.5 determinants can provide a strong theoretical basis for the prevention and control of air pollution. Therefore, many scholars are committed to understanding source analyses, emission inventories, chemical conversion, and regional transport of PM2.5 and have achieved significant results. In general, PM2.5 is considered to be emitted from local sources, including primary sources, such as traffic fumes, industrial activities, soil dust, biomass, and coal combustion, and secondary sources of gaseous pollutants (such as SO2, NOx, and NH3) formed by complex chemical reactions10,16,17,18. Thus, a local PM2.5 concentration change is the result of the combined action of natural and human factors19,20. Natural conditions, such as meteorology, topography, and vegetation, play important roles in the generation, accumulation, transfer, diffusion, and settlement of PM2.5 and profoundly affect its local concentration20,21. With respect to the influence of socioeconomic factors related to anthropogenic activities on the distribution of PM2.5, studies suggest that energy-intensive economic growth and nonecological urbanization increase PM2.5 concentrations, implying that economic development, urbanization, industrialization, land use, and energy use mixtures and efficiency can affect urban air quality13,22,23,24. These studies provide many insights into urban particle pollution from the perspectives of artificial and natural conditions.

The Yangtze River Economic Belt (YREB) is one of China’s most crucial economic and ecological corridors that connects three national urban agglomerations of the Sichuan–Chongqing, the middle reaches of the Yangtze River, and the Yangtze River Delta (YRD). The YREB accounts for more than 40% of China’s population and GDP, and its unique geographical advantages and vast economic hinterland make it the region with the greatest economic growth potential in the next 30 years25. However, urban air quality along the Yangtze has been deteriorating and has suffered from varying degrees of haze pollution due to continuous and intense industrial and human activities26. Recently, some areas in the middle-lower reaches of the Yangtze River have had more than 100 haze d year−1, with some cities even exceeding 200 d year−127. In the upper reaches of the Yangtze River, the ecological environment is relatively vulnerable as economic development is scaled up28. The alleviation of haze pollution involves the integrity of the ecological and environmental system and the quality of local people’s lives9. In 2014, the development of the YREB was given national strategy status, leading to the fact that potential conflicts between economic growth and environmental conservation may be more prominent than ever before29. In 2018, President Xi emphasized the protection of the eco-environmental status of the YREB. Hence, the eco-environmental conservation of the Yangtze River is a priority for the government of China25.

Until now, studies on the environmental problems of the YREB have mainly focused on the macro-level, including environmental quality and risk assessment, a low-carbon economy, and a sustainable development strategy9,28. Studies on environmental problems at the micro-level focus more on water pollution, while studies on air pollution are still insufficient27. As the China National Environmental Monitoring Centre (CNEMC) did not include PM2.5 in its monitoring index system until 2012, relatively few studies focused on PM2.5 in the YREB prior to 201230. The CNEMC has begun widespread real-time monitoring of PM2.5 since 2013. Furthermore, the use of remote sensing data for long-term time series research was limited to local areas and specific cities, and studies on factors of influence were mostly in terms of the overall perspective31. Obvious spatial differences in the natural conditions and socioeconomic development of various regions exist, and hence, a global analysis cannot reveal the spatial heterogeneity of the effects of various factors32. On the other hand, the majority of research has not conducted spatial analysis from a geographical viewpoint. Although some research has considered the spatial dependence of PM2.5 pollution, it has still neglected the spatial differentiation of the PM2.5 level at the urban agglomeration scale and lacked analysis of the interactive mechanism between factors, which may have led to incomplete and biased conclusions33.

In this study, we attempt to fill the aforementioned knowledge gaps. We consider 130 cities within the YREB as the research area and employ geostatistical and spatial analytical methods to summarize the spatiotemporal evolutionary features of PM2.5. Furthermore, we adopt a geographically weighted regression model and geographical detectors to quantitatively analyze the spatial differentiation and interactive effects of socioeconomic and natural conditions on PM2.5 during various periods. Thus, our study is significant in terms of meeting the demand for air quality improvements in the YREB and sheds light on implementing effective urban PM2.5 pollution abatement policies.

Data and methods

Study area

The YREB spans the eastern, central, and western regions of China, including nine provinces (Zhejiang, Jiangsu, Jiangxi, Anhui, Hubei, Hunan, Sichuan, Yunnan, and Guizhou) and two municipalities (Shanghai and Chongqing) (Fig. 1). The YREB comprises an area of 2.05 × 106 km2, accounting for 21.3% of the country’s land area. Moreover, its economy and population density are 6.2 and 4.5 times the national average, respectively, establishing its incomparable role in China’s development strategies8. However, the YREB is an ecologically vulnerable zone, and hence, assessing PM2.5 pollution in the region is of great strategic significance for national ecological security and regional sustainable development28.

Figure 1
figure 1

General overview of the YREB. Standard map services are provided by the Ministry of Natural Resources of China (http://bzdt.ch.mnr.gov.cn/), GS (2020)4619.

Data acquisition and processing

Our data sources have four components: (1) PM2.5 concentration data from satellite retrievals. The annual concentrations of PM2.5 were obtained from the global surface raster with a resolution of 0.01° × 0.01°, which was released by the Atmospheric Composition Analysis Group (ACAG) (https://sites.wustl.edu/acag/). This PM2.5 remote sensing inversion dataset has the largest global coverage, the longest time span, and the highest accuracy, and it has been widely verified and effectively applied in China34. To verify the accuracy of the dataset in the YREB, we calculated the annual concentration of each station through real-time monitoring data of the YREB from 2015 to 2017 obtained from the CNEMC (http://www.cnemc.cn) and correlated it with the PM2.5 concentration from the remote sensing inversion data at the corresponding position. The correlation coefficient was 0.82 and significant, indicating a strong correlation and consistency between these datasets. (2) Data on natural parameters include wind, precipitation, vegetation, and topography21,35. The normalized differential vegetation index (NDVI) is the best indicator of vegetation coverage and growth status, and the original data were derived from the National Aeronautics and Space Administration (NASA) (https://www.nasa.gov/), while other supporting data were obtained from the Resource and Environment Data Cloud Platform (http://www.resdc.cn). (3) Socioeconomic data includes per capita GDP, population density, economic density, urbanization rate, industrial structure, and energy consumption33,36. Studies have shown that there is a significant linear correlation between nighttime light data from the Defense Meteorological Satellite Program/Operational Line Scanner (DMSP/OLS) and energy consumption37. We converted the light intensity to gray pixel values and used the sum of all DMSP/OLS raster gray values in each region as an indicator of energy consumption in the region. The DMSP/OLS dataset was obtained from the National Oceanic and Atmospheric Administration (NOAA) (https://www.ngdc.noaa.gov), and the remaining data were collected from the China Urban Statistics Yearbook published by the National Bureau of Statistics (http://www.stats.gov.cn/). (4) Geographic information data includes the spatial vector map, which was derived from the National Catalog Service for Geographic Information (http://www.webmap.cn).

As multicollinearity in the selected variables may cause information redundancy, we adopted the variance inflation factor (VIF) for multicollinearity diagnosis with a threshold of 5. We excluded the indicators of economic density and energy consumption because their VIFs were greater than 5. Thus, 8 explanatory variables were used in the model, including per capita GDP (pgdp), population density (popd), urbanization rate (urba), secondary industry share (indu), annual wind speed (wind), annual precipitation (prec), NDVI (ndvi), and topographic relief (topo). To reduce data heteroscedasticity, all the indicators were treated with Z–standardization.

Methods

Spatial autocorrelation analysis

Tobler’s first law of geography states that everything is related to everything else, but nearby things are more related than distant things15. Therefore, we adopted the classical spatial autocorrelation method to quantitatively measure the spatial dependence of PM2.5 in neighboring regions. Global Moran’s I is written as38:

$$I\frac{{n\sum\limits_{i = 1}^{n} {\sum\limits_{j = 1}^{n} {W_{ij} \left( {x_{i} - \overline{x}} \right)\left( {x_{j} - \overline{x}} \right)} } }}{{\sum\limits_{i = 1}^{n} {\sum\limits_{j = 1}^{n} {W_{ij} \sum\limits_{i = 1}^{n} {\left( {x_{i} - \overline{x}} \right)^{2} } } } }}$$
(1)

where xi and xj are the observations of spatial units i and j, respectively; \(\overline{x }\) is the mean of n locations; and Wij is a spatial weight matrix. I [− 1, 1], where I > 0 indicates positive correlation, I < 0 indicates negative correlation, and I = 0 indicates mutual independence.

Spatial variogram analysis

The spatial variogram is a geostatistical method to describe the structure and randomness of regionalized variables and is often used to effectively measure the spatial structure characteristics and variation pattern of geographical variables39. The variogram is expressed as:

$$\gamma \left( \lambda \right) = \frac{1}{2N\left( \lambda \right)}\sum\limits_{i = 1}^{N\left( \lambda \right)} {\left[ {\xi \left( {x_{i} } \right) - \xi \left( {x_{i} + \lambda } \right)} \right]^{2} }$$
(2)

where ξ(xi) and ξ(xi + λ) are the values of the regionalized variables at points xi and xi + λ, respectively, and N(λ) is the sample size with separation distance λ.

There are three major parameters derived from the variogram model39,40,41. The nugget parameter (C0) is a random spatial variance; the partial sill parameter (C) is a structural spatial variance; the sill parameter (C0 + C) represents the total degree of spatial variation. The nugget effect (C0/(C0 + C)) indicates whether regional or local factors are more important for PM2.5. If the nugget effect is less than 0.25, it indicates strong spatial correlation; if it is between 0.25 and 0.75, it shows moderate spatial correlation; and if it is greater than 0.75, it indicates weak spatial correlation41. The range parameter (A0) represents the maximum spatial distance of the correlation.

The fractal dimension is another important parameter that characterizes the variogram, and its value is determined by the relationship between the variogram γ(λ) and distance λ:

$$2\gamma \left( \lambda \right) = \lambda^{{\left( {4 - 2D} \right)}}$$
(3)

where D is the slope of the double logarithm linear regression equation; the higher the value is, the higher the heterogeneity caused by the spatial autocorrelation. The closer the value is to 2, the more balanced the spatial distribution.

Geographically weighted regression modeling (GWR)

Geographically weighted regression (GWR) is a spatial regression model based on the idea of local smoothness42. This technique constructs an independent equation for each unit in the study area and incorporates the spatial attributes of data into the regression model so that the relationship between variables can change with the change in spatial location, thus reflecting the spatial nonstationarity of parameters in different regions. Its formula is as follows16:

$$y_{i} = \varphi_{0} \left( {\omega_{i} ,\alpha_{i} } \right) + \sum\limits_{i = 1}^{p} {\varphi_{p} \left( {\omega_{i} ,\alpha_{i} } \right)x_{ip} + \varepsilon_{i} }$$
(4)

where xip is a dimensional interpretation variable matrix; (ωi, αi) represents the longitude and latitude coordinates at the ith observation point; φp(ωi, αi) is the regression coefficient of the ith observation point; and εi is a random error term.

Geographical detector technique

The geographical detector technique is a set of statistical methods to detect spatially stratified heterogeneity and reveal the driving forces behind it43. It can detect the possible causal relationship between variables by verifying the consistency of the spatial distribution of two variables44. The explanatory power of the factor is measured by the q value, and its expression is as follows:

$$q = 1 - \frac{{\sum\limits_{h = 1}^{L} {N_{h} \sigma_{h}^{2} } }}{{N\sigma^{2} }}$$
(5)

where L refers to the strata of variable Y (PM2.5) or factor X; Nh and \({\sigma }_{h}^{2}\) are the number of units and variance of strata h, respectively; N and σ2 are the total number of units and variance, respectively;\(q\, \in \,\left[ {0,{ 1}} \right]\) , and the greater the value is, the stronger the explanatory power of this factor is.

The purpose of interactive detection is to assess whether the factors X1 and X2 work together to increase or decrease the explanatory power on Y or whether the impact of these factors on Y is independent. The evaluation method first calculates the q value of the two factors X1 and X2 acting on Y, namely, q(X1) and q(X2), and then calculates the q value for their interaction, namely, q(X1 ∩ X2); finally, q(X1) and q(X2) are compared with q(X1 ∩ X2). The discriminant criteria can be divided into 5 categories (Table 1).

Table 1 Types of interaction between two covariates.

Results

Spatiotemporal evolutionary characteristics of PM2.5

During 2000–2017, the overall PM2.5 level in the YREB showed a trend of first increasing and then decreasing (Fig. 2a). In 2005–2010, the annual PM2.5 concentration in the YREB was higher than the second-level standard of China of 35 μg/m3. During 2000–2007, the PM2.5 level increased from 23.49 to 37.37 μg/m3, an increase of 59.07%. Afterward, it decreased to 31.79 μg/m3 in 2017, a decrease of 14.93%. This improvement may be due to the effects of the national tenth 5-year plan on controlling the total emissions of major pollutants, adjusting the industrial structure, and establishing a monitoring, statistics, and assessment system for energy conservation and pollution emission reduction. It was further reinforced by the implementation of the Action Plan for Air Pollution Prevention and Control in 2013 (http://www.mee.gov.cn).

Figure 2
figure 2

Variation in the PM2.5 concentration and Moran’s I value in the YREB.

Figure 2a shows that the sliding interval of the global Moran’s I value over the years was [0.825, 0.894], and all the values were significant at the 99% level. This indicates that the PM2.5 distribution was not random but was a significant spatial agglomeration. We pursued the association of the evolution rules of spatial correlation with distance and found that the distance threshold to maintain spatial correlation was approximately 870 km (Fig. 2b). Within this spatial range, PM2.5 had significant positive interaction effects, which increased with the shortening of distance.

Figure 3 displays the spatial patterns and evolution of PM2.5 in the YREB from 2000 to 2017. Its main features are as follows: (1) cities with an annual PM2.5 level of less than 15 μg/m3 were mainly concentrated in ethnic minority areas, where the environmental conditions were relatively good. However, air quality continuously deteriorated, albeit at low levels, which needs attention. (2) The PM2.5 level was higher in the lower reaches than in the upper reaches and higher on the north bank than on the south bank, presenting a diagonal spatial distribution pattern with an obvious lowland plain directivity. (3) Urban economic activities and population density were closely related to PM2.5 in three centers, namely, the Cheng-Yu area, the Wuhan metropolitan area, and the northern Anhui–Jiangsu region.

Figure 3
figure 3

Spatial patterns of PM2.5 in the YREB between 2000 and 2017.

Spatiotemporal variation characteristics of PM2.5

The value of the variogram increased with increasing separation distance, indicating that the spatial autocorrelation of PM2.5 changed from strong to weak with increasing distance (Table 2). During 2000–2017, the variation range was 625–738 km, and it showed an overall upward trend, implying that the spatial correlation of PM2.5 was partly expanded in scope. In addition, the nugget effect indicated that regional-scale factors are more important for the distribution of PM2.5.

Table 2 Fitting parameters of the PM2.5 concentration variogram.

In terms of the fractal dimension (Table 3), the isotropic dimension continuously decreased from 1.536 in 2000 to 1.453 in 2017, indicating that the spatial difference in PM2.5 was continuously expanding. The northeast-southwest direction had the greatest goodness of fit and the smallest fractal dimension and showed a downward trend. This result denoted that the spatial variation in PM2.5 in this direction was continuously strengthened, making it the main direction in terms of spatial difference. The southeast-northwest fractal dimension was the largest, and its decisive coefficient continued to decrease, showing that the spatial difference in PM2.5 in this direction continued to weaken and remained relatively evenly balanced. We conducted 3D-kriging interpolation, which further depicted the spatial distribution and evolution morphology of PM2.5 (Fig. 4). It was evident that the spatial pattern steadily transitioned from a gradient differentiation to a relatively balanced structure that formed a trend indicating that the middle-lower reaches of the Yangtze River drove the whole basin PM2.5 level to increase.

Table 3 Variable difference dimension of PM2.5 concentrations.
Figure 4
figure 4

Evolution of PM2.5 concentrations in the YREB based on variograms.

Analysis models of the factors of influence

GWR model results

The GWR model fitting results are shown in Table 4, in which the adjusted R2 values are above 0.9, indicating good fitting performance.

Table 4 Fitting results of the GWR model.

The regression coefficients of socioeconomic factors, such as per capita GDP, population density, urbanization rate, and industrial structure, were mainly positive. Among them, population density had the largest impact on PM2.5 and the most obvious spatial difference, followed by per capita GDP, while the coefficients of urbanization and industrial structure were relatively small. The regression coefficients of natural factors, such as wind, precipitation, vegetation, and topography, were distributed in positive and negative intervals, and the instability was striking in different years. Among them, the coefficient of the distribution interval for topographic relief was the longest, indicating that spatial heterogeneity was the largest.

Spatial heterogeneity of factors of influence

The coefficient of anthropogenic factors increased, signifying that extensive economic development and intensive human activities aggravated haze pollution. Existing studies have argued that economic growth strongly correlates with regional environmental pollution, but the relationship between GDP and PM2.5 is significantly different in various regions19. Per capita GDP had positive impacts on PM2.5 in the middle-upper reaches of the Yangtze River (Fig. 5a–c). This implied that PM2.5 in economically underdeveloped areas was more sensitive to economic development and that the economic growth of these regions came at the cost of the environment. Some cities in the YRD showed negative correlation effects, which indicated that the development planning in the above areas was relatively good. With technological progress, industrial upgrading and economic development, these areas were essentially coordinated with the surroundings.

Figure 5
figure 5

Spatial distribution of regression coefficients of the GWR model.

The coefficient of population density showed approximately the same spatial distribution at the three time nodes (Fig. 5d–f), all of which increased from the coastal areas to the inland areas, and among them, the east–west difference was obvious in 2017. This may be because of the increase in urban traffic flow and production, with a higher population density contributing to an increase in the local PM2.5 level. The control of air pollution in the middle-upper reaches of the Yangtze Basin was still weak, thereby making the impact of population density more significant. Some studies have held that an increase in population density may have an agglomeration effect in promoting regional technical progress, thus facilitating a reduction in the local PM2.5 level14. The technical advances brought by the agglomeration effect for the population size in the YREB were not significant. This may be related to the migration of people to large cities in recent years and the disordered nature of population mobility.

The coefficient of the urbanization rate was positive at the three time nodes. The proportion of positive values in 2007 was relatively large, and the positive values in 2017 marginally declined (Fig. 5g–i). During 2000–2007, areas with low urbanization rates in the Yangtze Basin were experiencing rapid urbanization, and the urban infrastructure industry developed rapidly9. A large quantity of building dust entered the atmosphere, aggravating urban PM2.5 pollution. However, areas with a high urbanization rate, such as the YRD, tended to mature, and a stagnant infrastructure industry was conducive to the reduction in emissions of fine particles.

The industrial structure had a negative impact on PM2.5 in the middle-lower Yangtze and Cheng–Yu areas, aggravating local air pollution (Fig. 5j–l). This is consistent with the findings of existing studies confirming that industrial activities were the main drivers of PM2.5 in most areas19. The impact of the industrial structure in the Chengdu–Chongqing areas was strong, which may be because of heavy industries, such as the energy, chemical, and machinery industries, with relatively high direct energy consumption and pollutant emissions25. Optimization of the industrial structure significantly affects the local PM2.5 level. The coefficient had a weak impact on the YRD because the local industrial structure was dominated by the service industry and was relatively stable; thus, there was limited scope for further optimization.

The coefficients of wind in the inland areas were mainly positive and decreased from the central region to the west (Fig. 5m–o). The negative impact was dominant in the eastern areas, and as the distance to the coast decreased, the negative impact increased. This may be related to the impacts of topography and monsoons27. Coastal areas were mostly alluvial plains with flat terrains, and they were affected by the local circulation caused by monsoon and temperature differences. Clean air from the ocean had important dilution effects on pollutants, and thus, the coefficient was mainly negative. In the Sichuan Basin, the impact of closed topography restricted the diffusion of airflow, and the transport of wind caused pollutants in the region to interact with each other; thus, the coefficient was mainly positive. Similar findings were made in the Fenwei Plain, where basin topography exists30.

The impact of precipitation on regional PM2.5 presented a negative correlation at the three time nodes (Fig. 5p–r). A positive influence was mainly distributed in the Sichuan Basin and some areas of the Yunnan–Guizhou Plateau in western China, while the negative effect decreased from the coastal to inland areas. Concretely, the regions with a high regression coefficient were mainly located in the upper and middle reaches of the Yangtze Basin, while the eastern coastal areas with abundant rainfall had a small regression coefficient, indicating that abundant precipitation had a positive impact on PM2.5 in most cities, and this effect was more distinct in areas with relatively deficient precipitation.

The impact of the NDVI on PM2.5 had a negative correlation that was mainly distributed in the middle-lower Yangtze Plains (Fig. 5s–u). Research has shown that vegetation growth is correlated with climate and is affected by topography and human activities, resulting in a complex correlation between PM2.5 and vegetation21. The high-value areas were mainly in Yunnan and Hunan Provinces, while the low-value areas were concentrated in the middle-lower Yangtze Plain. In 2017, the NDVI showed a negative impact, indicating that vegetation’s inhibitory effect on PM2.5 was distinctly enhanced. This may be related to the rapid restoration and development of vegetation in the Yangtze Basin, whose forest coverage rate rose from 24% in 2000 to 39% in 20179.

Topographic relief was negatively correlated with regional PM2.5 in the Yangtze Basin, which was conducive to the improvement in air quality (Fig. 5v–x). Specifically, the high-value regions were located in Sichuan, Chongqing, and Zhejiang. Lower values were concentrated in the middle-lower reaches of the Yangtze River. Local topography affects the diffusion and dilution of atmospheric pollutants in an area by influencing meteorological conditions20.

Interactions between factors

Table 5 shows the results of the interactions between factors. The impact of the interactions between factors was greater than that for individuals, and the interaction effects included nonlinear enhancement and bifactor enhancement. When wind speed, precipitation, and vegetation interacted with PM2.5 in pairs, the nonlinear enhancement effect was generated at all three time nodes, and the explanatory power was notably varied in different periods. When the industrial structure interacted with the per capita GDP and urbanization level on PM2.5, a nonlinear enhancement effect was exerted at all three time nodes, and the explanatory power was continuously improved. Nonlinear enhancement means that the interactive impact of two factors is greater than the sum of the impacts when they act alone. The interactive types of pgdp ∩ popd, pgdp ∩ urba, wind ∩ topo, prec ∩ topo, and ndvi ∩ topo were dominated by nonlinear enhancement, although they varied at different times. The types of popd ∩ urba and popd ∩ indu exerted a bifactor enhancement effect, which was not as significant as that of the nonlinear enhancement.

Table 5 Results of interaction detecting.

Conclusions and policy implications

Conclusions

Haze pollution in Chinese cities has escalated to hazardous levels in recent years. This environmental problem has become a great challenge for public health and urban sustainable development. Our exploration of PM2.5 in the YREB provides useful results for haze prediction, which is an important step toward protecting people from health damage caused by poor air quality. The results indicate that the annual PM2.5 level of the YREB displayed an upward trend before 2007 and a fluctuating downward trend thereafter. PM2.5 had significant spatial heterogeneity and convergence characteristics. There were clear spatiotemporal differences in the impact of various factors on the PM2.5 pollution pattern. In the socioeconomic layer, population has the greatest impact, followed by the economy and industry, while urbanization was a relatively stable factor causing the rise in the PM2.5 level. Among the natural factors, topography and vegetation mainly exerted a negative impact on PM2.5, while the potency and direction of others changed with spatiotemporal changes.

Policy implications

As the Yangtze River plays a vital role in China’s eco-environmental systems, stricter measures should be implemented to meet the goal of sustainable development. First, special attention should be given to natural factors when distributing industries and residences; for instance, topography and wind, which notably affect PM2.5, should be considered, while urban air ducts and green belt designs must be optimized. Second, region-targeted policies should be considered based on spatial differentiation. Downstream areas should play a leading role in promoting pollution prevention and control technologies, while upstream areas should actively drive ecological protection. More emphasis should be placed on transregional linkage governance when formulating mitigation measures. Third, heavily and highly polluted industrial sectors must be closed or upgraded, as they are major sources of particulates, while sectors with environmental protection, new energy, and a low-carbon economy and technology should be highly encouraged, such as building hi-tech eco-industrial parks. Last, it seems inevitable that air pollution will be aggravated because of burgeoning urbanization, and therefore, reducing the impact of anthropogenic activity on the atmosphere through environmental policy and education will reduce PM2.5 pollution.

Limitations and recommendation

Our study mainly focuses on the exploration of the heterogeneity and determinants of PM2.5 at the macro-level. However, haze reduction ultimately needs to be implemented at micro-level enterprises, so investigations of micro-enterprises should be strengthened. In addition, PM2.5 is closely related to other air pollutants, which leads to compound air pollution with multiple pollutants. Therefore, it is necessary to explore the mechanism of compound pollution in the future to better formulate control measures.