Estimation of PM2.5 Concentrations in China Using a Spatial Back Propagation Neural Network

Wang, Weilin; Zhao, Suli; Jiao, Limin; Taylor, Michael; Zhang, Boen; Xu, Gang; Hou, Haobo

doi:10.1038/s41598-019-50177-1

Download PDF

Article
Open access
Published: 24 September 2019

Estimation of PM2.5 Concentrations in China Using a Spatial Back Propagation Neural Network

Weilin Wang^1,2,
Suli Zhao¹,
Limin Jiao^1,2,
Michael Taylor ORCID: orcid.org/0000-0002-3473-3478³,
Boen Zhang^1,2,
Gang Xu ORCID: orcid.org/0000-0001-5731-5036^1,2 &
…
Haobo Hou¹

Scientific Reports volume 9, Article number: 13788 (2019) Cite this article

5953 Accesses
51 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Methods for estimating the spatial distribution of PM_2.5 concentrations have been developed but have not yet been able to effectively include spatial correlation. We report on the development of a spatial back-propagation neural network (S-BPNN) model designed specifically to make such correlations implicit by incorporating a spatial lag variable (SLV) as a virtual input variable. The S-BPNN fits the nonlinear relationship between ground-based air quality monitoring station measurements of PM_2.5, satellite observations of aerosol optical depth, meteorological synoptic conditions data and emissions data that include auxiliary geographical parameters such as land use, normalized difference vegetation index, elevation, and population density. We trained and validated the S-BPNN for both yearly and seasonal mean PM_2.5 concentrations. In addition, principal components analysis was employed to reduce the dimensionality of the data and a grid of neural network models was run to optimize the model design. The S-BPNN was cross-validated against an analogous but SLV-free BPNN model using the coefficient of determination (R²) and root mean squared error (RMSE) as statistical measures of goodness of fit. The inclusion of the SLV led to demonstrably superior performance of the S-BPNN over the BPNN with R² values increasing from 0.80 to 0.89 and with the RMSE decreasing from 8.1 to 5.8 μg/m³. The yearly mean PM_2.5 concentration in China during the study period was found to be 41.8 μg/m³ and the model estimated spatial distribution was found to exceed Level 2 of the China Ambient Air Quality Standards (CAAQS) enacted in 2012 (>35 μg/m³) in more than 70% of the Chinese territory. The inclusion of spatial correlation upgrades the performance of conventional BPNN models and provides a more accurate estimation of PM_2.5 concentrations for air quality monitoring.

Global prediction of extreme floods in ungauged watersheds

Article Open access 20 March 2024

Disappearing cities on US coasts

Article Open access 06 March 2024

Heat health risk assessment in Philippine cities using remotely sensed data and social-ecological indicators

Article Open access 27 March 2020

Introduction

Long-term exposure to ambient fine particulate matter (PM) is associated with adverse human health conditions. PM_2.5 particles, with an aerodynamic diameter <2.5 μm, can be inhaled into the nasal passages and can carry toxic substances that are harmful to human health^1,2. Studies have shown that long-term exposure to high PM_2.5 concentrations can have serious impacts on human organs such as the liver, lungs and be responsible for the development of cardiovascular diseases^3,4,5,6. Therefore real time monitoring of PM_2.5 concentrations is extremely important for preventing pollution-related health issues as well as for the formulation of effective environmental protection measures.

Since 2013, many PM_2.5 monitoring stations across China have been established to measure air quality data. However, due to their sparse and uneven distribution, with most being located near cities, limitations still exist for effective and representative in situ monitoring of PM_2.5 concentrations at the regional scale^7,8,9. Monitoring capability can be increased by fusing satellite remote sensing data such as the aerosol optical depth (AOD) with PM_2.5 measurements and help in the construction of space-time models of PM_2.5 concentrations within or across regions. In addition, auxiliary datasets such as meteorological data, source emissions data, land use data, topographic data and socio-economic data, can also been used to reinforce the relationship between PM_2.5 concentrations and various observed variables^7,9,10.

Existing models for predicting PM_2.5 concentrations can be classified into two categories: deterministic models and statistical models^9,11. Deterministic models, including large-scale air quality simulations^12,13,14, model physical processes such as emission, dispersion, transformation, and diffusion, as well as the chemical reactions occurring in polluted air^11,15,16. However, since most models require sophisticated prior knowledge of pollutant diffusion states and chemical reaction pathways, deterministic PM_2.5 concentration estimation is complex and computationally expensive¹⁷. Statistical models, while not necessarily simpler in design, are however able to achieve an almost equivalent level of PM_2.5 concentration prediction accuracy¹⁸ and due to their greater speed, have been extensively developed and deployed for monitoring purposes. Linear statistical models including simple linear regression models¹⁹, multiple linear regression (MLR) models²⁰, empirical models^21,22, and geo-weighted regression models^8,10, have been able to obtain satisfactory results. However, the functional relationship between the PM_2.5 concentration and explanatory variables is nonlinear. As a result, many nonlinear statistical models have been used to estimate PM_2.5 concentrations including support vector regression, generalized additive models⁸, artificial neural network (ANN) models^{9,11,18,23,24} and more recently, deep learning methods^{9,11,18,23,24}. With improvements in computational capacity, models have also gradually incorporated more exogenous variables such as meteorological factors (e.g., relative humidity, temperature, and wind speed), land use factors, topographic data, source emissions data and socio-economic data^{9,11,18,23,24}.

Despite all these significant advances, most models have ignored the influence of the geographical distance between PM_2.5 monitoring stations as well as Tobler’s First Law of Geography²⁵ - that everything is related to everything else but that nearby things are more related than distant things. Furthermore, various studies have demonstrated that the distribution of PM_2.5 concentrations shows significant spatial auto-correlation. In this study, we aim to exploit this additional information by developing a spatial back propagation neural network (S-BPNN) that can improve the accuracy of PM_2.5 concentration estimation by explicitly including a spatial lag variable (SLV). The performance of the S-BPNN model is compared with that of a conventional back propagation neural network (BPNN) that does not include the SLV. We then use the S-BPNN to map the yearly and seasonal mean distribution of PM_2.5 concentrations across China for the study period, and assess exceedances.

Data and Methods

Data fusion

In order to construct a S-BPNN multivariate model, ground-level PM_2.5 concentration measurements were fused with satellite aerosol optical depth data, meteorological synoptic conditions data and source emissions data at 1280 monitoring sites in China, to form a large and spatially-diverse sample dataset of seasonal and yearly mean values. Table S1 presents the sources, units and spatial scales of each variable. Arcpy in ESRI’s ArcGIS was used to perform spatial interpolation for meteorological data as part of the preparation of the sample data.

Ground-level PM_2.5 measurements

Hourly PM_2.5 concentrations at 1280 stationary sites in 190 cities from 2015-01-01 to 2015-12-31 were collected from the official database of the China National Environmental Monitoring Centre (CNEMC: http://www.cnemc.cn/en/). PM_2.5 concentrations were measured via the tapered element oscillating microbalance method (TEOM) and then averaged at each site to produce time series of daily mean PM_2.5. Seasonal (spring: MAM, summer: JJA, autumn: SON, winter: DJF) and yearly mean PM_2.5 concentrations were then also calculated from the daily mean PM_2.5 for all monitoring stations. Figure 1 shows the spatial distribution of these PM_2.5 monitoring sites in China and the yearly mean value for 2015.

MODIS AOD satellite data

The AOD at 550 nm from the moderate resolution imaging spectraradiometer (MODIS) has been found to weakly correlate with PM_2.5 concentrations^11,26,27. Nonetheless, this is an important explanatory variable used to drive satellite remote sensing models of PM. We obtained AOD data from the MODIS Terra and Aqua Collection 6.1 via the NASA Level-1 and Atmospheric Archive and Distribution System (https://ladsweb.modaps.eosdis.nasa.gov). The AOD data has a maximum spatial resolution of 10 km and covers the study period from 2015-01-01 to 2015-12-31. The 10 km AOD products were retrieved using the Dark Target (DT) algorithm and the MODIS Conversion Toolkit (MCTK). For each grid cell, where Terra MODIS AOD (MOD04) data was available, we estimated missing Aqua MODIS AOD (MYD04) data by linear interpolation to extract values at the centre of the pixel. The same estimation procedure was used to for MOD04 when only MYD04 was available. MOD04 and MYD04 data was averaged where both products were available. Daily AOD products were then averaged to produce seasonal and yearly AOD values.

Synoptic conditions data

Meteorological data was then obtained from the Meteorological Data Sharing Service System in China (http://data.cma.cn/en) and includes wind speed (WS, m/s), relative humidity (RH, %), surface pressure (PRS, Pa), temperature (TEM, °C), precipitation (PRE, mm), and sunshine duration (SSD, h). There are 839 meteorological monitoring stations providing a total of 306,235 records during the study period. To obtain seasonal means and yearly means, we averaged the seasonal and yearly mean WS, RH, PRE, TEM, PRE, and SSD values calculated from the daily meteorological data across the monitoring network. Yearly mean distribution maps of each meteorological variable in the study area were then spatially interpolated with Arcpy in ESRI’s ArcGIS software by calculating the seasonal and yearly mean meteorological conditions at the monitoring stations with the inverse distance weighted (IDW) using a grid size of 10 km × 10 km. Values are extracted from the centre of 10 km grid.

PM_2.5 emissions data

Land use is a major contributor to the source apportionment of PM_2.5 pollution⁷. Land use data for 2015 having 30 metre resolution was obtained from the Geographical Information Monitoring Cloud Platform (http://www.dsac.cn/) and categorised into built-up areas, arable land, forest, water bodies, and bare land. We also downloaded NDVI and population density data having spatial resolution 1 km × 1 km from the Resource and Environment Data Cloud Platform (http://www.resdc.cn/) and LandScan (https://web.ornl.gov/sci/landscan/) respectively. In both cases, we extracted the pixel value closest to each PM_2.5 monitoring station. To account for the contribution of traffic emission pollution sources to the PM_2.5 concentration⁷, main roads within 10 km of the PM_2.5 monitoring sites were included using road network data downloaded from OpenStreetMap (www.openstreetmap.org/). To account for industrial sources of pollution, we considered the number and distribution of state monitoring enterprises for exhaust gas emissions as being indicative of the number and distribution of industrial pollution sources. The basic information (including addresses) of the exhaust gas monitoring enterprises in 2015 (totaling 3206) was obtained from the Ministry of Ecology and Environment of the People’s Republic of China (http://www.mee.gov.cn/). The Baidu Geocoding API (http://lbsyun.baidu.com/index.php?title=webapi/guide/webservice-geocoding) was then used to obtain the geographical location (latitude and longitude) of each enterprise. The number of emission enterprises within a radius of 10 km of the PM_2.5 monitoring stations was used as a measure of industrial emissions. Finally, digital elevation model (DEM) data was derived from the Geospatial Data Cloud (http://www.gscloud.cn/) and the pixel value containing or nearest to each PM_2.5 station was extracted.

Figure S1 presents the histograms and associated median statistics for the set of variables in the fused sample data set spanning the period 2015-01-01 to 2015-12-31 calculated from the 1280 PM_2.5 ground-based monitoring sites in China. The median value of the PM_2.5 concentration is 52.0 μg/m³. The distributions of AOD at 550 nm, WS, temperature, SSD, NDVI and construction land area exhibit some similarity to the distribution of PM_2.5 concentrations. In contrast, other variables including RH, pressure, precipitation, population density, road length, DEM base height, and the number of industrial pollution companies have distributions that are either multimodal or power law in nature. While these variables aren’t expected to strongly co-vary with the PM_2.5 concentration, recent studies have shown that they can nevertheless still have an influence on the spatial distribution of PM_2.5 concentrations^7,9,18,26. Consequently, we performed principal components analysis (PCA) on the full set of parameters, retaining those components that account for >98% of the total variance in line with recent approaches^24,28. As a result, the dimensionality of the was reduced from 15 variables (AOD, latitude, longitude, RH, WS, temperature, pressure, precipitation, NDVI, DEM, population density, number of pollution companies, road length, construction land area plus the SLV calculated from localised PM_2.5 concentrations described in the next section) to 11 principal components.

Spatial autoregression

In order to develop a continuous model for the distribution of PM_2.5 concentrations over China as from local measurements made at air quality network monitoring stations together with regionally-distributed independent variables, we construct a spatial autoregressive (SAR) model. SAR models are a class of statistical models that apply to observations over a continuous spatial domain typically made at local nodes of a network or vertices of a uniform or non-uniform grid. Importantly, they allow the effect of spatial correlation to be included explicitly. SAR extends conventional multiple linear regression by allowing outcomes in one area to be affected by outcomes in nearby areas (i.e. spatial lags on the dependent variable), by covariates from nearby areas (i.e. spatial lags on independent variables), and by spatially autoregressive errors from nearby areas. The general form for a first-order SAR model is given by^29,30:

$$\begin{array}{c}{\boldsymbol{y}}={\bf{X}}\beta +\rho {{\bf{W}}}_{1}{\boldsymbol{y}}+{\boldsymbol{u}}\\ {\boldsymbol{u}}=\lambda {{\bf{W}}}_{2}{\boldsymbol{u}}+{\boldsymbol{\varepsilon }}\\ {\boldsymbol{\varepsilon }}\sim N(0,{\sigma }^{2}{{\boldsymbol{I}}}_{n})\end{array}$$

(1)

where y is a [n × 1] scalar vector of observations of the dependent variable (PM_2.5 concentrations in our case), X is a [n × k] matrix of exogenous variables with [k × 1] regression coefficients β, W₁y, is the [n × 1] spatial lag variable (SLV) calculated from the weighted average of nearby PM_2.5 monitoring sites (see next section) with spatial autoregression parameter ρ, u is the error term expressed in terms of [n × 1] spatially-lagged errors W₁u, with spatial autoregression coefficient λ and ε which is a [n × 1] scalar vector of normally distributed (iid) random errors. W₁ and W₂ are the [n × 1] spatial weights matrices and in geophysical applications, are usually equivalent with the general property that all of their diagonal elements are zero and their rows sum to one. Equation (1) is sufficiently general that it embraces two major classes of spatial statistics models.

For ρ = 0, Eq. (1) reduces to the spatial error model (SEM) whereby the spatial dependence of y is correlated with spatial autoregression in the errors and measures the influence of errors in the exogenous variables in the local neighbourhood of observations of the dependent variable:

$${\boldsymbol{y}}={\bf{X}}\beta +{({\bf{I}}-\lambda {{\bf{W}}}_{2})}^{-1}{\boldsymbol{\varepsilon }}$$

(2)

For λ = 0, Eq. (1) reduces to the spatial lag model (SLM) whereby y is spatially-autocorrelated and models the diffusion of y over a region:

$${\boldsymbol{y}}={({\bf{I}}-\rho \lambda {{\bf{W}}}_{1})}^{-1}{\bf{X}}\beta +{({\bf{I}}-\rho \lambda {{\bf{W}}}_{1})}^{-1}{\boldsymbol{\varepsilon }}$$

(3)

The SLM therefore incorporates a spatial multiplier (I − ρW₁)⁻¹ called the “Leontief inverse” which connects the dependent variable y to all exogenous variables x_i in the system at location I and not just the error at location i. This observed behaviour has been reported in several studies that have shown that the distribution of PM_2.5 concentrations show significant spatial autocorrelation^11,26. For a specific grid, the SLV is given by:

$$SLV=\frac{\mathop{\sum }\limits_{i=1}^{n}w{s}_{i}P{M}_{2.5,i}}{\mathop{\sum }\limits_{i=1}^{n}w{s}_{i}}$$

(4)

where, n is the number of nearby PM_2.5 concentration measurements, ws_i = 1/ds_i is the spatial weight for the i^th nearby PM_2.5 concentration and ds_i is its spatial distance. SAR modeling with Arcpy in ESRI’s ArcGIS suggests that n = 3 is optimal for our sample data set. In order to construct an integrated model that not only reflects the local autocorrelation of the PM_2.5 concentrations but also expresses the nonlinear relationship between PM_2.5 concentration and independent variables, we explicitly incorporate the SLV as a virtual variable and construct a S-BPNN.

Spatial back-propagation neural network (S-BPNN)

Previous studies confirm that ANN models perform more efficiently than MLR models for PM air pollution monitoring and forecasting^9,18,31 due to their increased capacity for modeling the nonlinear relation between exogenous variables and PM_2.5 concentrations. A conventional BPNN with three layers (an input layer, a hidden layer and an output layer) was constructed in our study as a control model as it has the desired property that it can act as a universal function approximator (UFA)^32,33,34 and provide reliable baseline results. We then investigate the effect of including spatial autoregression by incorporating the SLV as an additional virtual variable in the set of exogenous inputs. We refer to the resulting model as a S-BPNN model of PM_2.5 concentration. As described in the section on Data Fusion, PCA was applied to the list of 15 exogenous parameters that include the SLV and the resulting 11 principal components, accounting for >98% of the total variance, were used as explanatory variables in the input layer. In accordance with the requirements of a UFA³⁵ the hidden layer comprises neurons having a nonlinear activation function. The output layer is a single linear neuron providing the PM_2.5 concentration. A schematic diagram of the S-BPNN is illustrated in Fig. 2.

Optimization of the S-BPNN model structure

When optimising the design of feed-forward back-propagation neural networks with supervised learning, training accuracy is expected to increase with the addition more neurons added to a hidden layer. However, this does not then necessarily translate into a corresponding improvement in overall model accuracy when tested on ‘unseen’ validation data. Apart from the obvious increase in training time needed, the problem is over-fitting of the neural network to the training data; resulting in a loss of ability to generalise on new data; significantly and negatively impacting overall model validity and performance^24,32. Whether a top-down approach to selecting the number of neurons with network weight pruning or a bottom-up convergence approach is adopted, the goal for setting the number of nodes in the hidden layers of feed-forward neural networks is to use as few nodes as necessary to meet the accuracy requirements of the model. Studies have shown that approximate bounds on the numbers of nodes in the hidden layer for most applications range from $2\sqrt{{\rm{n}}}+{\rm{\mu }}$ to2n + 1^9,36, where n and μ are the numbers of nodes in the input layer and output layer, respectively. With a single output and 11 inputs from PCA, this suggests that the optimal number of nodes in the hidden layer of the S-BPNN required to maximise performance ought to be in the range 7 to 23. Following previous studies^9,24,33, we applied a sensitivity analysis approach to the neural network architecture by running a grid of models varying the number of nodes in the hidden layer from 1 to 33 and monitoring the model accuracy. Each neuron in the hidden layer had a nonlinear logistic activation function $f(x)=\frac{1}{1+{e}^{-x}}$ and for weight and bias optimisation, MATLAB’s “traingdx” back-propagation algorithm was adopted which performs gradient descent with momentum (0.9) and an adaptive learning rate (0.05) using the mean squared error (MSE) as a cost function.

Figure S2 shows the training and validation goodness of fit statistics for the grid of runs used to optimise the architecture of the S-BPNN. In the evaluation of each run, 10-fold cross-validation was applied to the sample data divided 70%: 30% into a training (model fitting) dataset containing 896 records and a test (validation) dataset containing 384 records. The statistical indicators used to measure the goodness of fit are the coefficient of determination (R²), the root-mean-square error (RMSE, μg/m³), the mean prediction error (MPE, μg/m³), and relative prediction error (RPE, %) which are defined in the Supplementary Information accompanying this paper. As expected, we observed that, as the number of neurons in the hidden layer increases, S-BPNN model performance slightly improves for the training (fitting) data but gradually degrades for the test (validation) data. Over-fitting is observed when the number of neurons > 7 (the point beyond which the R² value continues to degrade from its first local maximum). As such, 7 hidden neurons were adopted as the optimal case. We also investigated the effect of changing the back-propagation training algorithm to the Levenberg-Marquardt algorithm and using the hyperbolic tangent (tanh) nonlinear activation function, but neither led to any improvement in the performance.

Results and Discussion

Performance of the models

After determining the optimal neural network architecture, we then partitioned the sample dataset comprising N = 1280 records (one for each monitoring station) including yearly and seasonal mean data, and trained and tested the S-BPNN and BPNN. A common framework was used for both models, with the exception that the S-BPNN includes the SLV in the principal components fed as inputs to the network. MATLAB’s Neural Network Toolbox version 6.0 was then used to build and train the S-BPNN and the BPNN. The accuracy of the trained models was calculated using 10-fold cross-validation. During this procedure, the sample data set was randomly divided into 10 parts; 9 of which were used for fitting and 1 for validation each time. The mean accuracy of 10 cross-validation trials for the networks trained on yearly mean data is shown in Table 1.

Table 1 Accuracy of the trained S-BPNN and BPNN models calculated with 10-fold cross-validation applied to yearly mean data.

Full size table

Inclusion of the SLV in the S-BPNN leads to an increase in performance with the mean R² of the fitting and validation data increasing from 0.80 and 0.75 respectively for the BPNN to 0.89 for the S-BPNN. A corresponding reduction in all mean error measures is observed. The RMSE of the S-BPNN never exceed 7.45 μg/m³. To further evaluate the performance of S-BPNN and BPNN models, scatter plots of the fitting and validation results are shown in Fig. 3.

To assess the performance on the seasonal timescale, the same approach was adopted and applied to seasonal mean datasets to train and evaluate S-BPNN and BPNN models. The mean accuracy of 10 cross-validation trials for the networks trained on the seasonal mean data are presented in Table 2. Some variation in model performance with season is apparent with lowest errors in the summer and largest errors in the winter seasons. This is to be expected for two reasons. Firstly, cloud cover is seasonal and impacts the quality of satellite AOD retrievals. While uncertainty on the model inputs was not explicitly included in the model design in this study, the propagation of uncertainty through spatial nonlinear models warrants future attention. Secondly, the sample data is constructed from daily averaging which leads to variation in the uncertainty of the sample data used to train the models at this timescale. This variation is expected to impact the quality assessment of models as their prediction timescale (yearly → seasonal) approaches the daily timescale. Nevertheless, it is clear that the S-BPNN outperforms the BPNN in all cases and that the incorporation of spatial information is of clear benefit even at shorter modelling timescales.

Table 2 Accuracy of the trained S-BPNN and BPNN models calculated with 10-fold cross-validation applied to seasonal mean data.

Full size table

Spatial distribution of PM_2.5 concentration

In Fig. 4, the S-BPNN model was used to map the seasonal and yearly mean distribution of PM_2.5 concentrations in China at a spatial resolution of 10 km. The distributions of seasonal and annual PM_2.5 concentrations have considerable spatial heterogeneity and spatial aggregation. This has two major impacts. Firstly, the sparse monitoring of large expanses of south-western China leads to significant gaps in model input data in this region. Secondly, strong clustering of monitoring stations in regions of high urbanisation means that spatial data in these regions is likely to be more representative. The S-BPNN modeled yearly mean PM_2.5 concentration for China in 2015 was found to be 41.76 μg/m³ and reflects well the value of 52.0 μg/m³ calculated at the nodes of the air quality monitoring network. Importantly, the interpolative power of the S-BPNN model at interstitial locations allows for a moderate resolution assessment of national exceedances nationwide. We find that more than 70% of Chinese territory exceeds Level 2 of the Ambient Air Quality Standards (CAAQS) having a yearly mean concentration > 35 μg/m³.

Overall, the levels of PM_2.5 concentrations are higher in the northern regions than in the southern regions. Heavily polluted regions are located in the North China Plain, especially Beijing-Tianjin-Hebei (BTH), and south-western Xinjiang, where the highest annual PM_2.5 concentration reached 108 μg/m³. However, the causes for such high levels of PM_2.5 are different in these locations. Pollution in the North China Plain’s is caused mainly by industrial emissions and is exacerbated by stagnant weather, with a weak wind and a relatively low boundary layer height reducing the dispersion, transformation and diffusion of atmospheric gases and chemical reactions²⁶. South-western Xinjiang’s pollution is due to desert dust particles which make a significant contribution to the accumulation of PM_2.5 concentrations⁷. While lower level regions of PM_2.5 concentrations are found in the south provinces (e.g., Hainan, Guangdong, Fujian and Yunnan), these regions benefit from low levels of industrial source emissions and favourable meteorological conditions for gas dispersion and chemical reaction in the atmosphere. While it is immediately apparent that winter is the most polluted season with high levels of PM_2.5 concentrations and summer is the cleanest with the lowest levels, some regions also exhibit high levels of PM_2.5 concentrations in the spring especially in the North China Plain and over Northwest China. PM_2.5 pollution is mitigated to some extent in the autumn months.

Conclusions

Faced with the complexity involved in modelling PM_2.5 concentrations deterministically, neural network-driven statistical models like BPNNs have been developed with demonstrable advantages for the estimation and mapping of PM_2.5 concentrations and other particulate matter components of air pollution^{9,24,37,38,39,40}. By incorporating spatial correlation information using a SLV in an S-BPNN model design, we were able to improve the accuracy and performance of a BPNN trained to retrieve PM_2.5 concentrations at 10 km resolution from satellite AOD, meteorological data, land use data, source emission data and related geographical data. The exogenous variables used encompass not only static features that impact PM_2.5 concentrations, but also the dynamic processes at work at the local and regional scale.

Cross-validation results suggest that the S-BPNN model outperforms conventional BPNN models and provides accurate estimates of yearly mean PM_2.5 concentrations and exceedances for China to a precision of RMSE < 7.45 μg/m³. Similarly reliable estimates were also obtained for seasonal means over China, recovering understood patterns in the geographical distribution of pollution sources across the country both in terms of geography and also in terms of synoptic conditions. Sensitivity analysis applied to neural network design using a grid of runs in combination with 10-fold cross-validation enabled model performance to be optimised in a systematic way, and ensured that the models produced are robust and reproducible.

Despite the satisfactory performance of the S-BPNN model, the SLV only accounts for the spatial distance between PM_2.5 monitoring stations. It is possible that a further increase in performance can be achieved by extending the SAR model to include also the spatial lag of covariates as well as spatially autoregressive errors both within and between spatial domains. In particular, it is expected that the inclusion of and propagation of uncertainty information on the variates will to help models capture higher frequency variability PM concentrations. It is also not yet clear what the balance is between increased SAR model complexity and nonlinearity in terms of the efficiency and performance of statistical models of PM_2.5 concentrations, i.e. whether or not a linear framework could be adequate. In a future study we will adapt the SLM to MLR models of PM and consider other approaches such as deep learning to help identify spatial features and model trends in the data. For this, we will exploit the availability of high-resolution satellite AOD products to map at even higher resolution greater detail, to estimate the distribution of PM_2.5 concentrations.

Data Availability

The research data sets used in this work are available upon request from Limin Jiao (lmjiao@whu.edu.cn). Access to monitoring data is permitted subject to the consent of the respective observatories owning the source instruments, and according to their internal policies for data administration. Please refer the author list for contact details.

References

Nel, A. Air pollution-related illness: effects of particles. Science. 308, 804–806 (2005).
Article CAS Google Scholar
Zhang, Y.-L. & Cao, F. Fine particulate matter (PM_2.5) in China at a city level. Sci Rep. 5, 14884 (2015).
Article ADS CAS Google Scholar
Kampa, M. & Castanas, E. Human health effects of air pollution. Env. Poll. 151, 362–367 (2008).
Article CAS Google Scholar
Lelieveld, J., Evans, J. S., Fnais, M., Giannadaki, D. & Pozzer, A. The contribution of outdoor air pollution sources to premature mortality on a global scale. Nature. 525, 367–371 (2015).
Article ADS CAS Google Scholar
Madrigano, J. et al. Long-term exposure to PM_2.5 and incidence of acute myocardial infarction. Env. Health Persp. 121, 192–196 (2013).
Article Google Scholar
Zhang, Q. et al. Transboundary health impacts of transported global air pollution and international trade. Nature. 543, 705–709 (2017).
Article ADS CAS Google Scholar
Fang, X., Zou, B., Liu, X., Sternberg, T. & Zhai, L. Satellite-based ground PM_2.5 estimation using timely structure adaptive modeling. Remote Sens. Env. 186, 152–163 (2016).
Article ADS Google Scholar
Zou, B., Chen, J., Zhai, L., Fang, X. & Zheng, Z. Satellite Based Mapping of Ground PM_2.5 Concentration Using Generalized Additive Modeling. Remote Sens. Basel. 9, 1 (2016).
Article ADS Google Scholar
Li, T., Shen, H., Zeng, C., Yuan, Q. & Zhang, L. Point-surface fusion of station measurements and satellite observations for mapping PM_2.5 distribution in China: Methods and assessment. Atmos. Env. 152, 477–489 (2017).
Article CAS Google Scholar
He, Q. & Huang, B. Satellite-based mapping of daily high-resolution ground PM_2.5 in China via space-time regression modeling. Remote Sens. Env. 206, 72–83 (2018).
Article ADS Google Scholar
Li, X. et al. Long short-term memory neural network for air pollutant concentration predictions: Method development and evaluation. Env. Poll. 231, 997–1004 (2017).
Article CAS Google Scholar
Chen, J. et al. Seasonal modeling of PM_2.5 in California’s San Joaquin Valley. Atmos. Env. 92, 182–190 (2014).
Article CAS Google Scholar
Saide, P. E. et al. Forecasting urban PM₁₀ and PM_2.5 pollution episodes in very stable nocturnal conditions and complex terrain using WRF–Chem CO tracer model. Atmos. Env. 45, 2769–2780 (2011).
Article CAS Google Scholar
Wang, Z., Maeda, T., Hayashi, M., Hsiao, L. F. & Liu, K. Y. A Nested Air Quality Prediction Modeling System for Urban and Regional Scales: Application for High-Ozone Episode in Taiwan. Water Air & Soil Poll. 130, 391–396 (2001).
Article ADS Google Scholar
Geng, G. et al. Estimating long-term PM_2.5 concentrations in China using satellite-based aerosol optical depth and a chemical transport model. Remote Sens. Env. 166, 262–270 (2015).
Article ADS Google Scholar
Donkelaar, A. V. et al. Global estimates of ambient fine particulate matter concentrations from satellite-based aerosol optical depth: development and application. Env. Health Perspect. 118, 847–855 (2010).
Article Google Scholar
Stern, R. et al. A model inter-comparison study focussing on episodes with elevated PM₁₀ concentrations. Atmos. Env. 42, 4567–4588 (2008).
Article CAS Google Scholar
Gupta, P. & Christopher, S. A. Particulate matter air quality assessment using integrated surface, satellite, and meteorological products: 2. A neural network approach. J. Geophys. Res. Atmos. 114 (2009).
Wang, Z., Chen, L., Tao, J., Zhang, Y. & Su, L. Satellite-based estimation of regional particulate matter (PM) in Beijing using vertical-and-RH correcting method. Remote Sens. Env. 114, 50–63 (2010).
Article ADS Google Scholar
Li, C., Hsu, N. C. & Tsay, S. C. A study on the potential applications of satellite data in air quality monitoring and forecasting. Atmos. Env. 45, 3663–3675 (2011).
Article CAS Google Scholar
Liu, Y., Sarnat, J. A., Kilaru, V., Jacob, D. J. & Koutrakis, P. Estimating ground-level PM_2.5 in the eastern United States using satellite remote sensing. Env. Sci. Tech. 39, 3269–3278 (2005).
Article CAS Google Scholar
Lv, B., Cobourn, W. G. & Bai, Y. Development of nonlinear empirical models to forecast daily PM_2.5 and ozone levels in three large Chinese cities. Atmos. Env. 147, 209–223 (2016).
Article CAS Google Scholar
Zou, B. et al. Spatial modeling of PM_2.5 concentrations with a multifactoral radial basis function neural network. Env. Sci. & Poll. Res. 22, 10395–10404 (2015).
Article CAS Google Scholar
Taylor, M., Retalis, A. & Flocas, H. A. Particulate Matter Estimation from Photochemistry: A Modelling Approach Using Neural Networks and Synoptic Clustering. Aerosol & Air Qual. Res. 16, 2067–2084 (2016).
Article CAS Google Scholar
Tobler, W. R. A Computer Movie Simulating Urban Growth in the Detroit Region. Econ. Geog. 46, 234–240 (1970).
Article Google Scholar
Li, T., Shen, H., Yuan, Q., Zhang, X. & Zhang, L. Estimating Ground-Level PM_2.5 by Fusing Satellite and Station Observations: A Geo-Intelligent Deep Learning Approach. Geophys. Res. Lett. 44(11), 911–985 993 (2017).
Google Scholar
Wu, J. et al. Applying land use regression model to estimate spatial variation of PM_2.5 in Beijing, China. Env. Sci. & Poll. Res. 22, 7045–7061 (2014).
Article Google Scholar
Hinton, G. S. R. S. Reducing the dimensionality of data with neural networks. Science. 313, 504–507 (2006).
Article ADS MathSciNet CAS Google Scholar
Chen, X. & Dai, E. Comparison of spatial autoregressive models on multi-scale land use. Trans. Chinese Soc. Agric. Eng. 27, 324–331 (2011).
Google Scholar
Anselin, L. Spatial econometrics:methods and models. (Dordrecht, Kluwer Academic Publishers, 1988).
Perez, P. & Reyes, J. An integrated neural network model for PM₁₀ forecasting. Atmos. Env. 40, 2845–2851 (2006).
Article CAS Google Scholar
Mao, X., Shen, T. & Feng, X. Prediction of hourly ground-level PM_2.5 concentrations 3 days in advance using neural networks with satellite data in eastern China. Atmos. Poll. Res. 8, 1005–1015 (2017).
Article Google Scholar
Wu, Y. et al. Synergy of satellite and ground based observations in estimation of particulate matter in eastern China. Sci. Tot. Env. 433, 20–30 (2012).
Article CAS Google Scholar
Yao, L. & Lu, N. Spatiotemporal distribution and short-term trends of particulate matter concentration over China, 2006–2010. Env. Sci. & Poll. Res. 21, 9665–9675 (2014).
Article CAS Google Scholar
Hornik, K., Stinchcombe, M. & White, H. Multilayer feedforward networks are universal approximators. Neur. Net. 2, 359–366 (1989).
Article Google Scholar
Reich, S. L., Gomez, D. R. & Dawidowski, L. E. Artificial neural network for the identification of unknown air pollution sources. Atmos. Env. 33, 3045–3052 (1999).
Article CAS Google Scholar
Xu Gang, J. L. M. Y. Spatial and Temporal Variability of the PM_2.5/PM₁₀ Ratio in Wuhan, Central China. Aerosol & Air Qual. Res. 17, 1–11 (2017).
Article Google Scholar
Xu, G., Jiao, L., Zhao, S. & Cheng, J. Spatial and temporal variability of PM_2.5 concentration in China. Wuhan University Journal of Natural Sciences. 21, 358–368 (2016).
Article CAS Google Scholar
Xu, G. et al. Examining the Impacts of Land Use on Air Quality from a Spatio-Temporal Perspective in Wuhan, China. Atmos. Basel. 7, 62 (2016).
ADS Google Scholar
Lv, B., Cai, J., Xu, B. & Bai, Y. Understanding the Rising Phase of the PM_2.5 Concentration Evolution in Large China Cities. Sci. Rep. 7 (2017).

Download references

Acknowledgements

This work was supported by the National Key R&D Program of China (Grant Number 2017YFA0604404) and National Natural Science Foundation of China (No. 41571385).

Author information

Authors and Affiliations

School of Resource and Environmental Sciences, Wuhan University, 129 Luoyu Road, Wuhan, 430079, China
Weilin Wang, Suli Zhao, Limin Jiao, Boen Zhang, Gang Xu & Haobo Hou
Key Laboratory of Geographic Information System, Ministry of Education, Wuhan University, 129 Luoyu Road, Wuhan, 430079, China
Weilin Wang, Limin Jiao, Boen Zhang & Gang Xu
Department of Meteorology, University of Reading, Reading, RG6 6BB, UK
Michael Taylor

Authors

Weilin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Suli Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Limin Jiao
View author publications
You can also search for this author in PubMed Google Scholar
Michael Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Boen Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Gang Xu
View author publications
You can also search for this author in PubMed Google Scholar
Haobo Hou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.J. and W.W. had the original idea and designed the study, S.Z., G.X., B.Z. and H.H. performed data analysis and interpretation, L.J., W.W. and M.T. wrote the manuscript, and, G.X., B.Z., H.H. and S.Z. contributed to data processing and the manuscript revision. All authors reviewed the manuscript.

Corresponding author

Correspondence to Limin Jiao.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Estimation of PM2.5 Concentrations in China Using a Spatial Back Propagation Neural Network

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, W., Zhao, S., Jiao, L. et al. Estimation of PM2.5 Concentrations in China Using a Spatial Back Propagation Neural Network. Sci Rep 9, 13788 (2019). https://doi.org/10.1038/s41598-019-50177-1

Download citation

Received: 17 April 2018
Accepted: 06 September 2019
Published: 24 September 2019
DOI: https://doi.org/10.1038/s41598-019-50177-1

This article is cited by

A novel hybrid prediction model for PM2.5 concentration based on decomposition ensemble and error correction
- Hong Yang
- Junlin Zhao
- Guohui Li
Environmental Science and Pollution Research (2023)
Data-driven predictive modeling of PM2.5 concentrations using machine learning and deep learning techniques: a case study of Delhi, India
- Adil Masood
- Kafeel Ahmad
Environmental Monitoring and Assessment (2023)
A new hybrid prediction model of PM2.5 concentration based on secondary decomposition and optimized extreme learning machine
- Hong Yang
- Junlin Zhao
- Guohui Li
Environmental Science and Pollution Research (2022)
A Hybrid Atmospheric Satellite Image-Processing Method for Dust and Horizontal Visibility Detection through Feature Extraction and Machine Learning Techniques
- Mehdi Amiri
- Seyfollah Soleimani
Journal of the Indian Society of Remote Sensing (2022)
Methodological notes on pandemic virus SARS-CoV-2 research
- Gianluigi Zangari del Balzo
Theory in Biosciences (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Data and Methods

Data fusion

Ground-level PM2.5 measurements

MODIS AOD satellite data

Synoptic conditions data

PM2.5 emissions data

Spatial autoregression

Spatial back-propagation neural network (S-BPNN)

Optimization of the S-BPNN model structure

Results and Discussion

Performance of the models

Spatial distribution of PM2.5 concentration

Conclusions

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links

Ground-level PM_2.5 measurements

PM_2.5 emissions data

Spatial distribution of PM_2.5 concentration