Modeling fine-grained spatio-temporal pollution maps with low-cost sensors

Iyer, Shiva R.; Balashankar, Ananth; Aeberhard, William H.; Bhattacharyya, Sujoy; Rusconi, Giuditta; Jose, Lejo; Soans, Nita; Sudarshan, Anant; Pande, Rohini; Subramanian, Lakshminarayanan

doi:10.1038/s41612-022-00293-z

Download PDF

Article
Open access
Published: 12 October 2022

Modeling fine-grained spatio-temporal pollution maps with low-cost sensors

Shiva R. Iyer¹,
Ananth Balashankar¹,
William H. Aeberhard²,
Sujoy Bhattacharyya^3,4,
Giuditta Rusconi^4,5,
Lejo Jose⁶,
Nita Soans⁶,
Anant Sudarshan⁷,
Rohini Pande⁸ &
…
Lakshminarayanan Subramanian ORCID: orcid.org/0000-0001-8101-1243¹

npj Climate and Atmospheric Science volume 5, Article number: 76 (2022) Cite this article

2976 Accesses
5 Citations
4 Altmetric
Metrics details

Subjects

Abstract

The use of air quality monitoring networks to inform urban policies is critical especially where urban populations are exposed to unprecedented levels of air pollution. High costs, however, limit city governments’ ability to deploy reference grade air quality monitors at scale; for instance, only 33 reference grade monitors are available for the entire territory of Delhi, India, spanning 1500 sq km with 15 million residents. In this paper, we describe a high-precision spatio-temporal prediction model that can be used to derive fine-grained pollution maps. We utilize two years of data from a low-cost monitoring network of 28 custom-designed low-cost portable air quality sensors covering a dense region of Delhi. The model uses a combination of message-passing recurrent neural networks combined with conventional spatio-temporal geostatistics models to achieve high predictive accuracy in the face of high data variability and intermittent data availability from low-cost sensors (due to sensor faults, network, and power issues). Using data from reference grade monitors for validation, our spatio-temporal pollution model can make predictions within 1-hour time-windows at 9.4, 10.5, and 9.6% Mean Absolute Percentage Error (MAPE) over our low-cost monitors, reference grade monitors, and the combined monitoring network respectively. These accurate fine-grained pollution sensing maps provide a way forward to build citizen-driven low-cost monitoring systems that detect hazardous urban air quality at fine-grained granularities.

Leveraging low-cost sensors to predict nitrogen dioxide for epidemiologic exposure assessment

Article 09 April 2024

Non-linear probabilistic calibration of low-cost environmental air pollution sensor networks for neighborhood level spatiotemporal exposure assessment

Article 09 November 2022

Using a network of lower-cost monitors to identify the influence of modifiable factors driving spatial patterns in fine particulate matter concentrations in an urban environment

Article 06 August 2020

Introduction

Pollution prediction in cities with dense populations can be critical for generating fine-grained policy recommendations and public health warnings^1,2,3. The scale of accurate sensor-based monitoring required to achieve this can come at a huge cost and thus inhibit building a dense fine-grained pollution sensing map. The deployment of low-cost particulate matter sensors to replace or augment reference grade pollution air quality monitoring systems has been studied extensively recently, and have addressed issues of calibration^4,5,6, design^7,8, data selection⁹, and personal exposure quantification^10,11. However, building a highly accurate large scale fine-grained pollution sensing and monitoring map that leverages the size of a pollution network has been largely unexplored. Specifically, modeling the behavior of noisy low-cost sensors in cities with high pollution and population density has not been studied previously, with recent state-of-the-art mapping approaches providing errors only in the range of 30–40%^12,13. This high error lends the pollution sensing map unusable for policymaking and air quality hazard detection. Prior work on deploying low-cost sensor networks for air pollution have been successful on a small scale (within 2 km radius) with high rates of agreement for PM 2.5 measurements in Southeastern United States¹⁴. Survey studies have shown that there is a necessity for a paradigm shift towards crowd-funded sensor networks to enable fine-grained sensing-based applications on a large scale¹⁵. The question of calibration issues in such large scale settings has been explored recently with promising results without the need for significant recalibration¹⁶ after well-controlled laboratory calibration¹⁷. PM 2.5 prediction models recently have explored deep neural networks like long-short term memory (LSTM), convolution neural networks (CNN), attention-based models; vector regression, partial differential equations, but focus on a single unified model at a single location, rather than in a large scale sensor network setting^{18,19,20,21,22,23,24}.

Recent work has also explored the use of distributed sensor networks to gather information on air pollution and other meteorological variables in urban contexts^{25,26,27,28,29}. Clements et al. ³⁰ provide a comprehensive review of many such works. Researchers have sought to learn more about how pollution sensing systems of low-cost sensors may be deployed in urban contexts^{14,31,32,33,34,35,36}. With the exception of Gao et al. ³⁶, who examine the performance of fine particulate sensors in Xi’an in China, most of these deployments have occurred in areas with significantly lower air pollution than the city of Delhi in India. Gao et al. ³⁶ also point out that low-cost PM_2.5 sensors may perform worse in very low pollution environments, suggesting that they may be relatively more useful when particulate concentrations are high. Related approaches in this space can be broadly classified into three groups—spatial interpolation approaches, land-use regression, and dispersion models Xie et al. ³⁷, Jerrett et al. ³⁸. In the case of dispersion models, they assume that an appropriate chemical transport model is identified along with their parameter values, and a high-quality emissions inventory. In the case of land-use regression models, having access to environmental characteristics that significantly influence pollution is critical. This additional data is often suited for longer range predictions, as the geographical and meteorological data vary over a longer temporal and coarser spatial grids^39,40.

In this paper, we describe a methodology to model and predict urban air quality at a fine-grained level using dense and noisy, low-cost sensors. There are two main questions we seek to answer in this paper—(i) how can we use a network of low-cost and portable air quality monitors in order to build a fine-grained pollution heatmap in a city that provides accurate prediction?, (ii) does it help to augment existing monitoring networks by the local governments with low-cost air quality sensors?

We deploy a network of 28 low-cost sensors, many of them concentrated in the south Delhi area, in collaboration with Kaiterra⁴¹, a company that makes low-cost air quality monitors and air filters. We dramatically increase the density of the deployment by 28× in Delhi (area 573 mi²) with 28 sensors, compared to previous deployments (Xi’an - area 3898 mi², 8 low-cost sensors). Further, the large longitudinal dataset we have been able to capture over 2 years as compared to prior work, which captured at most a few weeks of data, allows us to model long-term seasonal changes and train more complex neural network models that can adapt to seasonal and daily patterns. We build on prior work and model the pollution network in its entirety, with prediction models at each sensor location using data from near-by sensor locations.

We model pollution at any location in Delhi as measured by the concentration of fine particulate matter (PM_2.5) measured in μgm⁻³ using historical data of up to 8 h from all the sensors in the network. We make this choice of building a fine-grained pollution sensing map over shorter timelines to leverage the primary advantage of low-cost sensors while overcoming the drawback of noise by aggregating numerous spatio-temporal measurements. By learning the variability of each of these noisy measurements through message passing neural networks (MPRNN) which have the ability to model each sensor separately, we learn to not only separate the signal from the noise, but build an accurate sensing network of low-cost sensors that achieves <10% root mean squared earror (RMSE) in predicting up to one hour in advance over a fine-grained spatio-temporal grid as compared to baseline modeling approaches that provide 30% RMSE. By using a sparse network of sensors, whose signals are shared through neural network embeddings, we learn to capture the information from nearby sources that might affect the readings of nearby sources (e.g., factory) and ignore the ones which are heavily localized (e.g., food cart). Such an accurate, fine-grained pollution sensing map (≤10% MAPE) is usable by policymakers in deciding which neighborhoods of the city need interventions to improve the air quality and population health. To the best of our knowledge, we are the first in attempting to model a city-scale sensor network deployment with low-cost sensors augmenting high-quality government monitoring stations. With a sensor network the size of a city, with 60 sensors spread across the city of Delhi (700 sq km), capturing spatio-temporal variations and constructing accurate pollution maps necessitates modeling each sensor separately. By increasing the scale and addressing the corresponding modeling challenges, our work has widespread implications for pollution sensing and its low-cost deployability.

Results

Our data consists of PM_2.5 concentration data averaged to the hour from the 28 low-cost sensors and the 32 government monitors, a total of 60 monitors, collected over a period of 24 months, from May 1, 2018, to May 1, 2020. We use the until Oct 30, 2019 for training (75%) and hold out the remaining (25%) for testing. We report two criteria—the RMSE and the mean absolute percentage error (MAPE). We evaluate our models on the data from the combined set of our 28 low-cost sensors and the 32 government monitors, as well as separately on each set. For each of these locations, we compare our model-based predictions with the ground truth of the measurement of the pollution sensor.

Overall, the MPRNN model with imputed data using STHM along with the spline correction provides a very highly accurate estimation of the PM concentration level across all locations (ref Table 1). The best performing model is able to predict PM_2.5 concentrations with an average RMSE of 10.1 μgm⁻³ and MAPE of 9.6% across all the locations and over the testing period. While estimating a spline per location provides the best predictive performance, we note that using an average spline across all observed locations only marginally increases the RMSE and MAPE errors. The average spline is computed after averaging the data over all the locations. Across all locations, the median RMSE and MAPE are 9.15 μgm⁻³ and 8.64% respectively (ref Fig. 1). The best case values are 4.28 μgm⁻³ and 5.57% respectively, and the worst case values are 24.1 μgm⁻³ and 19.64% respectively. The location where we have minimum MAPE is at a location in Green Park, a very busy area of south Delhi, further validating the need for fine-grained pollution sensing in a large city like Delhi.

Table 1 RMSE and MAPE of prediction of PM concentrations, averaged across all the sensor locations.

Full size table

**Fig. 1: Prediction errors of PM_2.5 during the test period (Nov 1, 2019–May 1, 2020) shown as the mean absolute percentage error (MAPE) of the ground truth and predicted PM_2.5 concentration.**

Spatial variations

The 3-way cubic spline fit shows a common trend of baseline pollution rising steadily up to 8 am, then decreasing up to 4 pm and then increasing again until midnight. We note that this is the composite polynomial model of the PM concentrations in an average day (ref Fig. 2). The median error of this model is about 40 μgm⁻³ at each of the three windows, 12 am–8 am, 8 am–4 pm and 4 pm–12 am, and this is reduced to about 10 μgm⁻³ post the neural network model fit on the residuals. Figure 2 and Supplementary Fig. 2 show the per-sensor splines and the average spline in detail. Not only do the per-sensor splines vary widely across space, we notice that regions with significantly high spline residual errors like the sensors A838, E8E4, and 2E9C in Supplementary Fig. 2, are all located in central locations of Delhi with well established commercial activity like Connaught Place, Sardarjung Enclave and Lado Sarai respectively. Further, in Supplementary Fig. 2, the outliers with significantly high residual error splines among the government monitoring stations are Patparganj DPCC, Punjabi Bagh DPCC, and DKSSR DPCC. While Patparganj is situated next to an industrial area, Punjabi Bagh is a well-known residential locality with established commercial activity centers, and DKSSR, short for Dr. Karni Singh Shooting Range, is a shooting range located in the outskirts of Delhi next to an interstate highway. The diversity of these splines across various geographical regions further indicate the need to model fine-grained pollution profiles in seemingly remote as well as central locations of Delhi. We also note that the average spline can sufficiently operate for bootstrapping at locations where we do not have enough sensor data to begin with.

**Fig. 2: The interpretation of the spline correction, and its effect on the residual.**

For the most part, locations that exhibited high residual errors after MPRNN fit continued to show high error (relative to other locations) even after spline correction, even though the magnitude of the residual decreases. This phenomenon is partially explained by the high baseline values of the sensors with high residual errors, that is often coupled with high variance in measurement.

Effect of network size and training data

The fewer the monitors we used in our hybrid model, the greater was the final prediction performance. As Supplementary Fig. 3 shows, with only one monitor in the network, the predictive errors are about 35 and 20 μgm⁻³, respectively, for the low-cost sensor network and government network. However, as we include data from more nodes in the network, final prediction error drops sharply to about 15% and then gradually tails off at about 10%. The error flattens out about 30 sensors, which is approximately the number of sensors of each type that we have in our experiment. We infer that having an even denser deployment likely adds little value to the predictive performance. Further, decreasing the amount of training data to train the model shows that at minimum, one year of data is required to capture the seasonal trends and achieve RMSE of almost 10% (Supplementary Table 3).

Discussion

The low MAPE and RMSE across all monitors in Delhi provided by our Per-Sensor Spline+MPRNN with STHM imputation model are significant as it means that our model can detect hazardous air quality with high precision. The RMSE error is significantly lower than the observed variance in PM_2.5 concentrations in a day, making it useful for short-term and intraday analyses as well. The WHO air quality standards prescribe that PM_2.5 levels should not exceed 5 and 15 μgm⁻³ at an annual and daily average levels, while the Indian Government air quality standards prescribe 40 and 60 μgm⁻³, respectively. We note that for the 60 sensors, Delhi has exceeded these prescribed levels 371 out of the 641 days on a daily level, across 2 years of our measurement. The 9.6 % MAPE error that we are able to achieve, corresponds to the ability to detect hazardous air quality as per Indian government standards with 93.5% precision and 90.8% recall. This further indicates that the low error rate we have obtained leads to an almost exact prediction of hazardous air quality. This enables citizen-driven sensing where pollution sensor readings can be crowdsourced, and effective policy interventions like clean energy policies that penalize construction sites that have PM_2.5 levels more than 25% higher than the nearest monitoring center can be operationalized⁴². Specifically, the improvement in predictive power is achieved in specific pollution hotspots like bus stations, markets, etc. (Fig. 1). In addition, we can provide transparency of the overall average pollution of the city⁴³ and contribute towards increasing the co-benefits of clean energy policies^44,45.

Calibration

Since the data used to measure the model performance is new, it is important to understand the spatial variations and heterogeneity in measurements that underlies the sensor network. To further ensure that the improvement in model’s prediction performance is better than the noise in the data, we performed extensive calibration of the sensors. For this, we leveraged the calibration performed in-house by the sensor manufacturer (Kaiterra⁴⁶) (more info in Appendix) which confirms that re-calibration is not required⁴⁷, and also perform validation by comparing our sensor readings with the readings provided by the nearest government pollution monitoring station. Supplementary Figure 5 shows the cross-calibration of the average pollution value reported by the 28 government monitors with the average value of the 18 sensors in our testbed in the locality of South Delhi. We observe that the sensors have been fairly well calibrated with the reference monitors and report a similar average value across the city despite individual sensor level and spatio-temporal variations. This provides confidence in the data generated from this pilot to be useful as a reference for pollution modeling and forecasting.

Further, we also performed a nearest neighbor calibration where we compute temporal correlation of our sensor with the nearest government monitoring station of that sensor. Supplementary Table 4 shows that on average the correlation coefficients are >0.8, which indicates that there is no statistical significant difference between them on average (t-test, confidence level: 0.05, p-value: 0.0011). Further, in Supplementary Fig. 4, we see that when we order our sensors by the nearest neighboring government station, the cross-correlations between our sensors are correspondingly aligned, with high correlation between nearby sensors and low correlation between farther sensors. This further emphasizes the importance of the improvement in modeling as it significantly improves the prediction capabilities of a fine-grained sensor network, which can capture spatial variations in pollution of Delhi.

The development of fine-grained pollution sensing maps at low-costs can further catalyze the deployment of such monitoring networks in other polluted cities, where the pollution networks are sparse. With citizens procuring, deploying, and modeling pollution of cities accurately, this paper provides a way forward for developing high-quality fine-grained pollution sensing maps.

Methods

Summary

We model the spatio-temporal prediction problem as a graph prediction problem, where we predict a value at every node at a certain time using as input the historical values from neighboring nodes. In our setting, each sensor location v ∈ V is a node in an undirected graph. Assuming that air pollutants diffuse uniformly in all directions and exert their influence throughout our region of interest, in this case the greater Delhi region, we make the graph complete, where an edge exists between every pair of nodes. The end goal is to train a model that predicts at any node, the pollution level, measured in terms of the concentration of fine particulate matter PM2.5, at time t given one or more readings from neighboring locations prior to t. The first step is to interpolate the gaps in the data. We use a geostatistics model for this task, called the Spatio-temporal Hierarchical Model (STHM). Then we fit a cubic spline based on daily trends at each sensor location, and then finally train a Message-Passing Recurrent Neural Network (MPRNN) (Section 4.4) to predict residuals over the baseline. In order to account for the amount of influence based on the pairwise distances, we include the Euclidean distance between sensors as part of our feature embedding in our message-passing formulation. We test this model by predicting values at locations where sensors, and therefore ground truth information, are present, but the model is generalized enough to be used to predict at locations where there is no ground truth data available. If $y_{v,t}$ is the reading of the sensor at location v, at timestamp t, and ${\hat{y}}_{v,t}$ is our corresponding prediction, the prediction model aims to minimize the mean absolute percentage loss:

$${\rm{MAPE}} = \sum_v \sum_t \frac{|{\hat y}_{v,t} - y_{v,t}|}{y_{v,t}}$$

(1)

Our pollution forecasting model for estimating the PM_2.5 particulate matter concentration across space and time consists of three important steps. Given the variations in data availability across our pollution sensors, the first step of our method uses a standard Spatio-Temporal Hierarchical Model (STHM) to estimate the missing data. Our STHM model is a standard statistical modeling framework from geostatistics that combines multiple sources of information, accommodates missing values, and computes predictions in both space and time. Based on daily variation patterns observed at each of the pollution sensors, the second step in our method estimates a three-way cubic spline at each sensor location, one for each disjoint 8 h interval in a 24 h period (12 am to 8 am, 8 am to 4 pm and 4 pm to 12 am), representing three different patterns in the PM_2.5 variations. The cubic splines for each sensor represented a baseline level of PM_2.5 concentration. The cubic splines may provide a good approximation to the overall average daily variations across sensors but do not capture short term spatio-temporal variations represented by the residual errors in the baseline. The final step of our method is to train a Message-Passing Recurrent Neural Network (MPRNN) across the pollution monitoring points to estimate the residual errors from neighboring sensors. We will briefly describe the characteristics of our data and then explain the cubic spline and MPRNN methodology in this section. We refer the reader to the supplementary text for a detailed description of the STHM model.

Data

The data used for the modeling the air pollution levels in Delhi was sourced from a combination of 32 local government monitors and a network of 28 low-cost sensors deployed by us in various locations of Delhi from May 2018 to May 2020. The average availability of each of these sensors are about 90 and 30% over the measured period, respectively. This disparity is attributed to a variety of factors such as disconnection for periodic necessary calibration, network outages and periodic servicing of sensors. The sensors are calibrated against the government sensors, by conducting a longitudinal comparison study by measuring in proximity to the location of the government monitoring centers. The locations and their summary statistics of the sensors by location is given by the Supplementary Tables 1 and 2, and are shown visually in the box plots in Supplementary Fig. 1.

Cubic splines

We observe that on a daily basis, depending on the time of the day and the location, there is a low-frequency component that makes up an approximate “baseline level” of PM concentration. Based on this observation, we fit a piecewise polynomial function, called a spline, to model this low-frequency component. We divided a single day into a number of epochs and fit a spline for each epoch. Prior to implementing the cubic splines, we observed that the residual errors from the MPRNN model exhibits different errors at different times in the day. We then proceeded to fit cubic splines based on the daily spatio-temporal patterns per sensor and per location. For example, if our prediction error follows a temporal pattern of say, higher prediction error in the morning, while lower in the afternoon, we can leverage this fitting separate splines for morning and afternoon to subtract out this component. The spline can be of any order, but given our residual error patterns, but we found that piecewise cubic spline works best. Suppose at time t and location v, the raw PM value is given by y_v,t. Then, the piecewise spline to predict y, with time period p is given by:

$${\hat{y}}_{p}(v,t)={\alpha }_{v,p}* {t}^{3}+{\beta }_{v,p}* {t}^{2}+{\kappa }_{v,p}* t+{\nu }_{v,p}$$

(2)

Note that the chosen parameters per sensor α_v,p, β_v,p, κ_v,p, ν_v,p, where p ∈ {“morning”, “afternoon”, “evening”}, depend on the patterns in our residual errors and are fit accordingly to minimize the root mean-squared residual error:

$${\rm{RMSE}}(v)=\mathop{\sum}\limits_{t}\mathop{\sum}\limits_{p}\sqrt{{(y(v,t)-{\hat{y}}_{p}(v,t))}^{2}}$$

(3)

Message-passing recurrent neural network

MPRNN, based on refs. ^48,49, is a neural network architecture that is applied on a graph in order to predict values at each node in the graph. This approach enables to us incorporates spatial interactions between each pair of nodes as “messages” that are broadcast from every node to its neighbors. Each node has a modified version of a long short term memory (LSTM) network that iterates between message-passing and the recurrent computations.

Suppose y_v,t is a quantity of interest at node v and time t, for which we would like to build a predictive model. Mathematically, we would like to learn a function ${{{\mathcal{F}}}}$ such that, ${y}_{v,t+1}={{{\mathcal{F}}}}({v}_{1},{y}_{{v}_{1},t},{v}_{2},{y}_{{v}_{2},t},\ldots ;{v}_{j}\in {{{\mathcal{V}}}})$ where the set ${{{\mathcal{V}}}}$ denotes the set of all the nodes in the graph. A recurrent neural network unit is assigned to each node in the graph, with each node v maintaining a hidden state h_v,t at time t. Through a message-passing phase and a time-recurrent phase, our model infers the next hidden state, h_v,t+1 from which the PM value at v is decoded. A message-passing operation allows one segment to observe the hidden state of its neighboring segments.

The computation proceeds in five steps, as five layers of the neural network. In the first phase, the observation phase, the input observations ${Y}_{t}=\{{y}_{v,t}| v\in {{{\mathcal{V}}}}\}$ at time t are encoded into h_v,t by the observation operation O_v. In the second and third phases, one or more iterations of messaging (M) and updating (U) operations are performed to propagate the observations in the graph. In the fourth phase, for each node, a time-recurrent operator T_v utilizing an LSTM unit takes as input the final hidden state h_v,t and predicts the next hidden state h_v,t+1. The final phase is the readout operation R_v, which decodes the hidden state to produce the output value to be predicted ${\hat{y}}_{v,t+1}$. These five steps are shown below. The message function takes as input the hidden states of a pair of nodes v and n and the Euclidean distance between them, d_v,n as the influence of the pollution at a given location on the pollution at another location would depend on the distance between them. Hence, we include the distance in the embedding.

$${h}_{v,t}={O}_{v}({h}_{v,t-1},{y}_{v,t})$$

(4)

$${m}_{v,t}=\mathop{\sum}\limits_{n\in V-v}M({h}_{v,t},{h}_{n,t},{d}_{v,n})$$

(5)

$${h}_{v,t}=U({h}_{v,t},{m}_{v,t})$$

(6)

$${h}_{v,t+1}={T}_{v}({h}_{v,t})$$

(7)

$${\hat{y}}_{v,t+1}={R}_{v}({h}_{v,t+1})$$

(8)

For a selection of nodes ${{{\mathcal{W}}}}$ in the graph, the components of the model $\{{O}_{w},M,U,{T}_{w},{R}_{w},| w\in {{{\mathcal{W}}}}\}$ are defined. During inference, the states ${H}_{t}=\{{h}_{w,t}| w\in {{{\mathcal{W}}}}\}$ are maintained at each time step. The hidden state for each segment is initialized at t = 0 randomly during training and evaluation ${h}_{v,0} \sim {{{\mathcal{N}}}}(0,1)$.

Training and validation

We used the data from May 1, 2018, to Nov 1, 2019, a period of 18 months, as the training period. The number of samples we had for training were 166,979 from our low-cost sensor network, and 371,806 from the government network, resulting in a total of 538,785 samples. The model was trained at each sensor location, using as input data from all the other monitors except itself, over the entire training period. We used the Adam optimizer⁵⁰ with a learning rate of 0.001, and ran the training for 30 epochs to ensure a robust and well-trained model. To validate the model, we used the remaining 6 months data from Nov 1, 2019, to May 1, 2020. The number of ground truth samples available in this period were 20,408 and 91,493 in the low-cost network and government network, respectively, resulting in a total of 111901 samples. However, only 12 out of the 28 low-cost sensors were operational in the testing phase, since many of them had not been serviced properly, partly owing to the COVID-19 pandemic. The testing error reported under Results (§2), therefore, shows the predictions tested at 12 low-cost sensor locations and 32 government monitors, a total of 44 locations combined. Further, to understand the implications of availability of less data during training, we evaluated our model as shown in Supplementary Table 3 and found that with training data less than a year, our model’s performance significantly decreases as seasonal trends are not well captured.

Implementation

The MPRNN is implemented using the Deep Graph Library⁵¹ and PyTorch ⁵² in Python. The model diagram is shown in Fig. 3.

**Fig. 3: Message passing recurrent neural network for pollution monitoring in Delhi.**

Baselines

We contrast our combined model with two alternative modeling approaches in order to set a baseline to benchmark the MPRNN model performance. The first one is the STHM itself, a state-of-the-art spatio-temporal modeling methodology. When the STHM is used solely for the prediction, it performs poorly, as it does not model unknown non-linear spatial dependencies due to dispersion. The second baseline is an alternative neural network formulation that collects information from a specified number (K) of nearest neighbors to a location L, and feeds them into a trained recurrent neural network, to predict the value at L. Unlike the MPRNN, this model does not account for explicit spatial influence between every pair of sensors, thus allowing us to see how a more simplified multi-variate non-linear model might perform. We call this model the k-Nearest Neighbor (k-NN) Spatial Neural Network.

Data availability

The data that supports the findings of this study comprises two parts—the PM2.5 data from the government monitors and the data collected from our low-cost sensor network. The former is public data and can be accessed here⁵³. The data can also be provided by the authors upon request. The latter is third-party data and the authors are bound by a confidentiality agreement with Kaiterra, the makers of the low-cost sensors, and can only be made available for confidential peer review, if requested by reviewers, within the terms of the data use agreement and if compliant with ethical and legal requirements.

Code availability

All the relevant code can be obtained upon request from the corresponding author. The code is also available on GitHub: https://github.com/shivariyer/epod-nyu-delhi-pollution.

References

Shaddick, G., Thomas, M., Mudu, P., Ruggeri, G. & Gumy, S. Half the world’s population are exposed to increasing air pollution. NPJ Clim. Atmos. Sci. 3, 1–5 (2020).
Article Google Scholar
Rao, N. D., Kiesewetter, G., Min, J., Pachauri, S. & Wagner, F. Household contributions to and impacts from air pollution in India. Nat. Sustain. 4, 1–9 (2021).
Article Google Scholar
Geng, G. et al. Drivers of pm2. 5 air pollution deaths in china 2002–2017. Nat. Geosci. 14, 645–650 (2021).
Article Google Scholar
Liu, H.-Y., Schneider, P., Haugen, R. & Vogt, M. Performance assessment of a low-cost pm2. 5 sensor for a near four-month period in Oslo, Norway. Atmosphere 10, 41 (2019).
Article Google Scholar
Liu, X. et al. Low-cost sensors as an alternative for long-term air quality monitoring. Environ. Res. 185, 109438 (2020).
Article Google Scholar
Giordano, M. R. et al. From low-cost sensors to high-quality data: A summary of challenges and best practices for effectively calibrating low-cost particulate matter mass sensors. J. Aerosol Sci. 158, 105833 (2021).
Article Google Scholar
Tryner, J. et al. Design and testing of a low-cost sensor and sampling platform for indoor air quality. Building Environ. 206, 108398 (2021).
Article Google Scholar
Prakash, J. et al. Real-time source apportionment of fine particle inorganic and organic constituents at an urban site in Delhi city: An iot-based approach. Atmospheric Pollution Res. 12, 101206 (2021).
Article Google Scholar
Bi, J. et al. Publicly available low-cost sensor measurements for pm2.5 exposure modeling: Guidance for monitor deployment and data selection. Environ. Int. 158, 106897 (2022).
Article Google Scholar
Zusman, M. et al. Calibration of low-cost particulate matter sensors: Model development for a multi-city epidemiological study. Environ. Int. 134, 105329 (2020).
Article Google Scholar
Mahajan, S. & Kumar, P. Evaluation of low-cost sensors for quantitative personal exposure monitoring. Sustainable Cities Soc. 57, 102076 (2020).
Article Google Scholar
Spyropoulos, G. C., Nastos, P. T. & Moustris, K. P. Performance of aether low-cost sensor device for air pollution measurements in urban environments. accuracy evaluation applying the air quality index (aqi). Atmosphere 12, 1246 (2021).
Chu, H.-J., Ali, M. Z. & He, Y.-C. Spatial calibration and pm 2.5 mapping of low-cost air quality sensors. Sci. Rep. 10, 1–11 (2020).
Article Google Scholar
Jiao, W. et al. Community air sensor network (cairsense) project: Evaluation of low-cost sensor performance in a suburban environment in the southeastern united states. Atmos. Meas. Tech. 9, 5281–5292 (2016).
Article Google Scholar
Morawska, L. et al. Applications of low-cost sensing technologies for air quality monitoring and exposure assessment: How far have they gone? Environ. Int. 116, 286–299 (2018).
Article Google Scholar
Stavroulas, I. et al. Field evaluation of low-cost pm sensors (purple air pa-ii) under variable urban air quality conditions, in Greece. Atmosphere 11, 926 (2020).
Article Google Scholar
Tancev, G. & Pascale, C. The relocation problem of field calibrated low-cost sensor systems in air quality monitoring: a sampling bias. Sensors 20, 6198 (2020).
Article Google Scholar
Kim, H. S. et al. Development of a daily pm 10 and pm 2.5 prediction system using a deep long short-term memory neural network model. Atmos. Chem. Phys. 19, 12935–12951 (2019).
Article Google Scholar
Kalajdjieski, J., Mirceva, G. & Kalajdziski, S. Attention models for pm 2.5 prediction. In 2020 IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT) 1–8 (IEEE, 2020).
Lin, L., Chen, C.-Y., Yang, H.-Y., Xu, Z. & Fang, S.-H. Dynamic system approach for improved pm 2.5 prediction in Taiwan. IEEE Access 8, 210910–210921 (2020).
Article Google Scholar
Pérez, P., Trier, A. & Reyes, J. Prediction of pm2. 5 concentrations several hours in advance using neural networks in Santiago, Chile. Atmos. Environ. 34, 1189–1196 (2000).
Article Google Scholar
Song, L., Pang, S., Longley, I., Olivares, G. & Sarrafzadeh, A. Spatio-temporal pm 2.5 prediction by spatial data aided incremental support vector regression. In 2014 International Joint Conference on Neural Networks (ijcnn) 623–630 (IEEE, 2014).
Wang, Y., Wang, H., Chang, S. & Avram, A. Prediction of daily pm 2.5 concentration in china using partial differential equations. PLoS One 13, e0197666 (2018).
Article Google Scholar
Qin, D. et al. A novel combined prediction scheme based on cnn and lstm for urban pm 2.5 concentration. IEEE Access 7, 20050–20059 (2019).
Article Google Scholar
Liu, T. et al. Seasonal impact of regional outdoor biomass burning on air pollution in three Indian cities: Delhi, Bengaluru, and Pune. Atmos. Environ. 172, 83–92 (2018).
Article Google Scholar
Chambliss, S. E. et al. Local- and regional-scale racial and ethnic disparities in air pollution determined by long-term mobile monitoring. Proc. Natl Acad. Sci. USA 118, e2109249118 (2021).
Article Google Scholar
Liang, Y. et al. Wildfire smoke impacts on indoor air quality assessed using crowdsourced data in California. Proc. Natl Acad. Sci. USA 118, e2106478118 (2021).
Article Google Scholar
Ferraro, P. J. & Agrawal, A. Synthesizing evidence in sustainability science through harmonized experiments: Community monitoring in common pool resources. Proc. Natl Acad. Sci. USA 118, e2106489118 (2021).
Ludescher, J. et al. Network-based forecasting of climate phenomena. Proc. Natl Acad. Sci. USA 118, e1922872118 (2021).
Article Google Scholar
Clements, A. L. et al. Low-cost air quality monitoring tools: From research to practice (a workshop summary). Sensors 17, 2478 (2017).
Article Google Scholar
Lin, C. et al. Evaluation and calibration of aeroqual series 500 portable gas sensors for accurate measurement of ambient ozone and nitrogen dioxide. Atmos. Environ. 100, 111–116 (2015).
Article Google Scholar
Shusterman, A. A. et al. The Berkeley atmospheric co 2 observation network: Initial evaluation. Atmos. Chem. Phys. 16, 13449–13463 (2016).
Article Google Scholar
Moltchanov, S. et al. On the feasibility of measuring urban air pollution by wireless distributed sensor networks. Sci. Total Environ. 502, 537–547 (2015).
Article Google Scholar
Sun, L. et al. Development and application of a next generation air sensor network for the hong kong marathon 2015 air quality monitoring. Sensors 16, 211 (2016).
Article Google Scholar
Tsujita, W., Yoshino, A., Ishida, H. & Moriizumi, T. Gas sensor network for air-pollution monitoring. Sensors Actuators B: Chem. 110, 304–311 (2005).
Article Google Scholar
Gao, M., Cao, J. & Seto, E. A distributed network of low-cost continuous reading sensors to measure spatiotemporal variations of pm2. 5 in Xi'an, China. Environ. Pollution 199, 56–65 (2015).
Article Google Scholar
Xie, X. et al. A review of urban air pollution monitoring and exposure assessment methods. ISPRS Int. J. Geo-Inform. 6, 389 (2017).
Article Google Scholar
Jerrett, M. et al. A review and evaluation of intraurban air pollution exposure models. J. Exposure Sci. Environ. Epidemiol. 15, 185 (2005).
Article Google Scholar
Yeh, C. et al. Using publicly available satellite imagery and deep learning to understand economic well-being in Africa. Nat. Commun. 11, 1–11 (2020).
Article Google Scholar
U.S. Environmental Protection Agency, Office of Air Quality Planning and Standards. Air Quality Assessment Division, Research Triangle Park, NC (2021).
Technologies, K. Laser egg. kaiterra.com (2022).
Harigovind, A. Dust management committee recommends air quality monitors at all large construction sites in Delhi. https://indianexpress.com/article/cities/delhi/dust-management-committee-recommends-air-quality-monitors-at-large-delhi-construction-sites-7437599/ (2021).
Somvanshi, A. Delhi’s air quality and number games. https://www.downtoearth.org.in/blog/air/delhi-s-air-quality-and-number-games-76214 (2021).
Qian, H. et al. Air pollution reduction and climate co-benefits in china’s industries. Nat. Sustain. 4, 417–425 (2021).
Article Google Scholar
Tibrewal, K. & Venkataraman, C. Climate co-benefits of air quality and clean energy policy in India. Nat. Sustain. 4, 305–313 (2021).
Article Google Scholar
Johnson, C. How kaiterra ensures that sensedge devices are accurate and correctly calibrated. https://learn.kaiterra.com/en/resources/how-sensedge-devices-are-accurate-and-correctly-calibrated (2022).
Technologies, K. Does the laser egg need to be recalibrated? https://support.kaiterra.com/does-the-laser-egg-need-to-be-recalibrated (2022).
Gilmer, J., Schoenholz, S. S., Riley, P. F., Vinyals, O. & Dahl, G. E. Neural message passing for quantum chemistry. Proceedings of the 34th International Conference on Machine Learning (ICML). Vol. 70, 1263–1272 (2017).
Iyer, S. R., An, U. & Subramanian, L. Forecasting sparse traffic congestion patterns using message-passing rnns. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 3772–3776 (2020).
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. International Conference on Learning Representations (ICLR). (2015).
Wang, M. et al. Deep graph library: A graph-centric, highly-performant package for graph neural networks. Preprint at https://arxiv.org/abs/1909.01315 (2019).
Paszke, A. et al. H. Advances in Neural Information Processing Systems 32 (eds. Wallach, H., Larochelle, H., Beygelzimer, A., d’Alché- Buc, F., Fox, E., & Garnett, R.) 8024–8035 (Curran Associates, Inc., 2019).
Central Pollution Control Board (CPCB). Central Control Room for Air Quality Management - All India. https://app.cpcbccr.com/ccr/#/caaqm-dashboard-all/caaqm-landing/caaqm-comparison-data (2022).

Download references

Acknowledgements

The work done by the authors Shiva Iyer, Ananth Balashankar, and Lakshminarayanan Subramanian in this paper was supported by funding from industrial affiliates in the NYUWIRELESS research group (https://www.nyuwireless.com), that funded Shiva Iyer in part as well as the air quality sensors used in the study. Shiva was also funded in part by an NSF Grant (award number OAC-2004572) titled “A Data-informed Framework for the Representation of Sub-grid Scale Gravity Waves to Improve Climate Prediction”. Mr. Balashankar is a Ph.D. student at New York University, and is also funded in part, by the Google Student Research Advising Program. We acknowledge our collaboration with Kaiterra for their efforts in the development and installation of the low-cost sensors. We acknowledge the data availability from CPCB on their public portal. We also acknowledge the contributions of Ulzee An, a former masters’ student, in writing code for older baseline models. Any opinions, findings and conclusions, or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of NYUWIRELESS or Kaiterra.

Author information

Authors and Affiliations

Department of Computer Science, New York University, New York, NY, USA
Shiva R. Iyer, Ananth Balashankar & Lakshminarayanan Subramanian
Swiss Data Science Center, ETH Zurich, Zurich, Switzerland
William H. Aeberhard
Columbia University, New York, NY, USA
Sujoy Bhattacharyya
Evidence for Policy Design (EPoD) at the Institute for Financial Management and Research (IFMR), New Delhi, New Delhi, India
Sujoy Bhattacharyya & Giuditta Rusconi
State Secretariat for Education, Research and Innovation (SERI), Bern, Switzerland
Giuditta Rusconi
Kai Air Monitoring Pvt Ltd, Gautam Buddha Nagar, UP, India
Lejo Jose & Nita Soans
Department of Economics, University of Chicago, Chicago, IL, USA
Anant Sudarshan
Department of Economics, Yale University, New Haven, CT, USA
Rohini Pande

Authors

Shiva R. Iyer
View author publications
You can also search for this author in PubMed Google Scholar
Ananth Balashankar
View author publications
You can also search for this author in PubMed Google Scholar
William H. Aeberhard
View author publications
You can also search for this author in PubMed Google Scholar
Sujoy Bhattacharyya
View author publications
You can also search for this author in PubMed Google Scholar
Giuditta Rusconi
View author publications
You can also search for this author in PubMed Google Scholar
Lejo Jose
View author publications
You can also search for this author in PubMed Google Scholar
Nita Soans
View author publications
You can also search for this author in PubMed Google Scholar
Anant Sudarshan
View author publications
You can also search for this author in PubMed Google Scholar
Rohini Pande
View author publications
You can also search for this author in PubMed Google Scholar
Lakshminarayanan Subramanian
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.I., A.S., R.P., and L.S. contributed to problem conceptualization and design. S.I., A.B., W.A., and L.S. contributed to the spatio-temporal models. S.I., A.B., and W.A. contributed to the code, data analysis, and visualizations. S.B. and G.R. contributed to the sensor network deployment and data gathering efforts in Delhi guidance of R.P. S.I., A.B., W.A., R.P., A.S., and L.S. helped in writing and editing various sections of the paper.

Corresponding author

Correspondence to Lakshminarayanan Subramanian.

Ethics declarations

Competing interests

Prof. Subramanian declares no competing non-financial interests but the following competing financial interests: Prof. Subramanian is a co-founder of Entrupy Inc, Velai Inc, and Gaius Networks Inc and has served as a consultant for the World Bank and the Governance Lab. Dr. Subramanian reports that Velai Inc broadly works in the area of socio-economic predictive models. All other authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Iyer, S.R., Balashankar, A., Aeberhard, W.H. et al. Modeling fine-grained spatio-temporal pollution maps with low-cost sensors. npj Clim Atmos Sci 5, 76 (2022). https://doi.org/10.1038/s41612-022-00293-z

Download citation

Received: 30 December 2021
Accepted: 30 August 2022
Published: 12 October 2022
DOI: https://doi.org/10.1038/s41612-022-00293-z

This article is cited by

The hidden mechanism under the policies: governance logic and network of the collaborative governance of air pollution control in the CCEC, China
- Keyi Gou
- Yan Liu
Environment, Development and Sustainability (2024)
State-of-art in modelling particulate matter (PM) concentration: a scoping review of aims and methods
- Lorenzo Gianquintieri
- Daniele Oxoli
- Maria Antonia Brovelli
Environment, Development and Sustainability (2024)
Utilizing a Low-Cost Air Quality Sensor: Assessing Air Pollutant Concentrations and Risks Using Low-Cost Sensors in Selangor, Malaysia
- Zaki Khaslan
- Mohd Shahrul Mohd Nadzir
- Mylene G. Cayetano
Water, Air, & Soil Pollution (2024)