Spatial calibration and PM2.5 mapping of low-cost air quality sensors

Chu, Hone-Jay; Ali, Muhammad Zeeshan; He, Yu-Chen

doi:10.1038/s41598-020-79064-w

Download PDF

Article
Open access
Published: 16 December 2020

Spatial calibration and PM_2.5 mapping of low-cost air quality sensors

Hone-Jay Chu¹,
Muhammad Zeeshan Ali¹ &
Yu-Chen He¹

Scientific Reports volume 10, Article number: 22079 (2020) Cite this article

5075 Accesses
29 Citations
Metrics details

Subjects

Abstract

The data quality of low-cost sensors has received considerable attention and has also led to PM_2.5 warnings. However, the calibration of low-cost sensor measurements in an environment with high relative humidity is critical. This study proposes an efficient calibration and mapping approach based on real-time spatial model. The study carried out spatial calibration, which automatically collected measurements of low-cost sensors and the regulatory stations, and investigated the spatial varying pattern of the calibrated low-cost sensor data. The low-cost PM_2.5 sensors are spatially calibrated based on reference-grade measurements at regulatory stations. Results showed that the proposed spatial regression approach can explain the variability of the biases from the low-cost sensors with an R-square value of 0.94. The spatial calibration and mapping algorithm can improve the bias and decrease to 39% of the RMSE when compared to the nonspatial calibration model. This spatial calibration and real-time mapping approach provide a useful way for local communities and governmental agencies to adjust the consistency of the sensor network for improved air quality monitoring and assessment.

SensEURCity: A multi-city air quality dataset collected for 2020/2021 using open low-cost sensor systems

Article Open access 26 May 2023

Martine Van Poppel, Philipp Schneider, … Rolf Haugen

Enhancing the reliability of particulate matter sensing by multivariate Tobit model using weather and air quality data

Article Open access 12 August 2023

Wan-Sik Won, Jinhong Noh, … Yong-Jin Yoon

Long-term field comparison of multiple low-cost particulate matter sensors in an outdoor urban environment

Article Open access 16 May 2019

Florentin M. J. Bulot, Steven J. Johnston, … Matthew Loxham

Introduction

Air pollution is a severe global problem, especially in developing countries. A major air pollutant in urban environments is fine particulate matter (PM_2.5), which affects human health negatively. However, the traditional existing air quality monitoring networks are sparsely deployed and lack the measurement density, because the installation and maintenance of air quality monitoring instruments are expensive. In the race to develop high-resolution, spatiotemporal air pollutant monitoring systems, low-cost air quality sensors are promising supplements to regulatory monitors for PM_2.5 exposure assessment^1,2. Low-cost sensors have been used to collect real-time high-density air pollution data^3,4. Low-cost sensors for PM_2.5 were developed from Internet of Things (IoT)⁵. Investigators can also deploy additional sensors to increase spatial coverage of air quality monitoring network⁶. Moreover, low-cost sensors can gather air quality information of the community within real time at any location. The sensors are potentially easy to use and maintain because they require less energy and space to operate^2,7.

Low-cost sensors have been proposed to stand alone or as complementary components of existing air quality monitoring regulatory networks, which measure air pollution concentrations. However, previous studies have pointed out the inconsistency between low-cost sensors and high-quality regulatory instruments^7,8,9. Low-cost observations may contain uncertainties mainly due to aerosol schemes, temperature, and high relative humidity^10,11. The sensors may also indicate bias, as the models are dependent on location, hygroscopic growth, and specific range of relative humidity^12,13. Each sensor type has distinct characteristics, and many factors affect light scattering, including particle size, shape, composition, and relative humidity¹¹. Moreover, the particle number and mass concentrations significantly increase in high relative humidity¹².

To minimize low-cost sensor bias and uncertainties, calibration is essential for producing validated data. Calibration should be processed before, during, and after the data collection¹⁴. The response of low-cost sensors could then be compared to the response of a reference instrument or a reference system. Even though low-cost sensor data attain low systematic bias after calibration, their precision is still not comparable to that of reference-grade measurements. Holstius⁸ et al. (2014) calibrated PM_2.5 from a reference instrument with Federal Equivalent Method (FEM) status. However, keeping the regional sensors well-calibrated during deployment is a challenge⁹. Few approaches have been used to create field calibration formulas^7,15. For example, linear models are used to adjust the raw data in certain cases¹⁶. Furthermore, nonlinear equations or machine learning methods are necessary to improve lower-cost sensor performance for air quality monitoring^4,15,17,18. The reduction of measurement errors of large-scale low-cost sensor data is difficult with current multivariate calibration models². To avoid regional misunderstanding of the measurements of low-cost air quality sensors, the development of a spatial calibration procedure is necessary. Thus, in the current study, we propose a real-time, regional, and simple way for low-cost sensor calibration based on reference-grade measurements at regulatory stations.

The objectives of the study are to develop the spatial varying relationship between the low-cost sensors and regulatory stations for calibration; to calibrate regional, low-cost air quality sensors against regulatory stations at one time slice; and to estimate the reliable PM_2.5 concentration map from calibrated low-cost air quality sensors. The real-time calibration function would vary with the specific location. As a basis for the spatial calibration model, the function was assumed to be related to locations, and the coefficient estimation was carried out in spatial regression. Moreover, regional PM_2.5 concentration estimations were conducted after the calibration. Hence, the spatial calibrated concentration from low-cost sensors would be able to identify the reliable regional pollution hotspots.

Materials

As the program for the low-cost PM_2.5 sensor was launched in 2015, cooperation between the Academia Sinica and businesses, e.g., Edimax, led to the installation of thousands of boxes in Taiwan wherein local communities have volunteered to install low-cost PM_2.5 sensors. The majority of air quality detection sensors, called AirBox, used the Realtek Ameba development board with the PMS5003 optical particulate matter sensor⁵. In this case, 2963 low-cost sensors were set-up (Fig. 1). The data are available from https://pm25.lass-net.org/. The AirBox PM_2.5 observations along with the environmental variables, e.g., temperature (°C), and relative humidity (%), were updated roughly every 5 min.

**Figure 1 please help to update fig 1 from the attachment**

The Taiwanese Environmental Protection Agency (TWEPA) has been regularly recording the air quality and meteorological data throughout Taiwan. Since August 2005, the TWEPA has completed the installation of 76 automatic monitoring instruments for PM_2.5 (Fig. 1). The current study used the hourly PM_2.5 data obtained by the TWEPA stations that complied with the regulatory air monitoring procedures. The data were accessed from https://opendata.epa.gov.tw/Data/Contents/ATM00625/.

In this case study, the PM_2.5 hourly data obtained by the TWEPA’s Air Monitoring Network and low-cost sensor real-time data were obtained for the fine aerosol concentration in Taiwan. The time-slice at 12:00 pm (UTC), 2020-02-24 is used as a case study. At that time, the relative humidity was high, and the air quality was poor from high PM_2.5 concentrations in the western part of Taiwan due to weak diffusion condition. The average relative humidity is 82.1% and average temperature is 22 °C in the low-cost sensors. In high relative humidity, problematic observations from the low-cost sensor are expected to be found. Temperatures between 17 and 27 °C were mostly insignificant factors of sensor bias in the study area⁷.

Method

The main steps involved calibrating the low-cost sensors and estimating the accurate map of PM_2.5 (Fig. 2). In the preprocessing of calibration (adjustment) model, the nearest neighbor data pair from low-cost sensor and regulatory station is used for collocation. Subsequently, the nonspatial and spatial calibration models are applied. After calibration, the regional PM_2.5 concentrations from the sensor data are estimated.

Suppose that $n$ regulatory observations are found in the dataset for low-cost sensor calibration in the time slice. Each observation consists of a response variable ${Y}_{i}$, and $B$-dimensional covariates ${X}_{i}={\left({x}_{1i}, \dots ,{x}_{Bi}\right)}^{T}$, $i=1,\dots ,n$. Here, ${Y}_{i}$ and ${X}_{i}$ are assumed to be random variables, and $E\left({Y}_{i}|{X}_{i}\right)$ is the conditional expectation of ${Y}_{i}$ given ${X}_{i}$.

$$E\left({Y}_{i}|{X}_{i}\right)={\beta }_{0}+{\sum }_{b=1}^{B}{\beta }_{b}{x}_{bi}$$

(1)

Here, the calibration model considers ${x}_{1i}$, which is the PM_2.5 concentration from the collocated low-cost sensors (nearest neighbors of regulatory observations), and ${Y}_{i}$ is the PM_2.5 concentration of regulatory sensors from TWEPA. The low-cost sensor calibration from nonspatial calibration model (global calibration: a single coefficient set ${\beta }_{0}, { \beta }_{1}$ for all sensors) is expressed as.

$$E\left({Y}_{i}|{X}_{i}\right)={\beta }_{0}+{\beta }_{1}{x}_{1i}.$$

(2)

The spatial calibration model is defined from a kernel-based varying-coefficient model¹⁹, which is spatial regression²⁰. The spatial calibration model requires the 2D coordinates of each observation as geographical information. Let $({{u}_{i},v}_{i})$ be the coordinate of the $i$th observation.

$$E\left({Y}_{i}|{X}_{i}, ({u}_{i},{v}_{i})\right)={\beta }_{0}\left({u}_{i},{v}_{i}\right)+{\sum }_{b=1}^{B}{\beta }_{b}\left({u}_{i},{v}_{i}\right){x}_{bi.}$$

(3)

The spatial regression coefficients, ${\beta }_{0}\left({u}_{i},{v}_{i}\right),\dots ,{\beta }_{B}\left({u}_{i},{v}_{i}\right)$, are estimated by weighted least-square method wherein the weights are evaluated by the distance between coordinates of the observed data $({u}_{i}, {v}_{i})$ and the neighborhood coordinate. Here, the PM_2.5 of the regulatory station is the response variable and the PM_2.5 of the collocated low-cost sensor is the covariate. Therefore, the spatial calibration model (local calibration: individual coefficient set each low-cost sensor) is expressed as.

$$E\left({Y}_{i}|{X}_{i}, ({u}_{i},{v}_{i})\right)={\beta }_{0}\left({u}_{i},{v}_{i}\right)+{\beta }_{1}\left({u}_{i},{v}_{i}\right){x}_{1i}.$$

(4)

In the spatial adjustment, the estimated spatial coefficient ${\widehat{\beta }}_{k}\left({u}_{i},{v}_{i}\right)$ at each observation i is derived from weighted least-squares

$${\widehat{\beta }}_{k}\left({u}_{i},{v}_{i}\right)={[{X}^{T}\mathrm{W}\left({u}_{i},{v}_{i}\right)X]}^{-1}{X}^{T}\mathrm{W}\left({u}_{i},{v}_{i}\right)Y,$$

(5)

where $\mathrm{W}\left({u}_{i},{v}_{i}\right)$ is a weight matrix based on the kernel function. The kernel distances are defined as Euclidian and Gaussian function. The Gaussian distance decay-based function is one of the typical functions of spatial autocorrelation. The geographical distance is quoted in meters. The elements in a weight matrix ${\mathrm{W}}_{ij}$ is the weighted function between observations i and j. The weight element can be computed as a kernel function

$${\mathrm{W}}_{ij}=exp\left(-\frac{{d}_{ij}^{2}}{{b}^{2}}\right),$$

(6)

where ${b}$ is a nonnegative parameter known as bandwidth, and ${d}_{ij}$ is the Gaussian distance decay-based function based on the geographical distance. The optimal bandwidth is determined by the criteria, i.e., cross validation. Meanwhile, the model residual (${e}_{i}$) at observation i is defined as

$${e}_{i}={Y}_{i}-{Y}_{i}^{^{\prime}},$$

(7)

where ${Y}_{i}$ is the observed concentration from regulatory station i, and ${Y}_{i}^{^{\prime}}$ is the calibrated concentration of the low-cost sensors near regulatory station i. To evaluate the model performance, the common measures, R² and RMSEs, are used based on the observation data and the estimated values at the regulatory stations.

Furthermore, the inverse distance weighting (IDW) method is a straightforward and low-computational approach for spatial interpolation procedures of the PM_2.5 concentration map. The IDW method is used in this study to generate a 2 km-resolution PM_2.5 concentration map after spatial calibration. The weights of the observations in the IDW are based on the inverse distances of the unknown point and observations. The parameter power value is 2 as the inverse distance squared weighted interpolation.

Results

Data description

The raw PM_2.5 concentrations from (a) low-cost sensor and (b) regulatory stations and their corresponding histograms (c and d) are shown in Fig. 3. The peaks of raw low-cost sensor data distribution are approximately over 20 and 50–60 μg/m³, but the peaks of raw regulatory station data are 10–20, 40–50 μg/m³, respectively. In addition, the average values of low-cost sensors and regulatory sensors are 51.9 and 36.7 μg/m³, and the standard derivations are 26.1 and 17.5 μg/m³, respectively. The average values of the low-cost sensor data are double than those of the regulatory sensors. Furthermore, the variance of low-cost sensor data is significantly larger than the EPA monitoring data (681 and 306 (μg/m³)² in the low-cost sensor and EPA data, respectively). The low-cost sensor data calibration is needed in the condition. Most PM_2.5 measurements from raw low-cost sensors and regulatory stations showed extremely high correlation with the time series, and these can be used to identify the PM_2.5 pollution hotspots⁷. However, the correlation of low-cost sensors and regulatory sensor data is only 0.8 from the collocated stations in this time slice.

Spatial and nonspatial calibration models

The nonspatial and spatial calibration models have R-square values of 0.64 and 0.94, respectively (Fig. 4). The RMSE of the low-cost sensor is modified from 17.7 to 10.5 μg/m³ after calibration from the nonspatial model, but can be improved to 4.1 μg/m³ using the spatial calibration model. Considering the spatial calibration model, the model performance improved, that is, the spatial scheme can reduce the inconsistency of low-cost sensors and the neighboring regulatory observations. Figure 5 shows the spatial pattern and data distribution of calibrated results based on nonspatial and spatial calibration models. The data distribution considering the nonspatial model is similar to the original low-cost sensor data. After calibrating by spatial model, the peaks of data distribution are 10–20, and 40–50 μg/m³, respectively, and the standard derivation decreased from 26.1 to 18.0 μg/m³. The distribution of calibrated data in the spatial model is transformed to similar data distribution of regulatory stations. The result shows that the spatial calibration model performed better than the nonspatial calibration model.

The spatial calibration model is a local linear transformation used to model spatially varying relationships²⁰. However, the nonspatial approach, i.e., global linear transformation, can lead to biased estimations of regression model parameters due to the sample outliers. The proposed spatial calibration model is based on a weighted local fitting approximation to the function being estimated. The approach adopted in this study can identify a nonlinear and spatial varying relation between a pair of low-cost sensors and regulatory stations. The spatial calibrated data can be used directly to estimate a reliable air quality map.

Spatial calibration coefficient

Figure 6 shows the spatial coefficients in the spatial calibration model and indicates that the coefficient varies with locations. When considering spatial calibration model, the average slope is 0.33, but the average intercept is 21.4 (the slope is 0.58 and the intercept is 9.4 in the nonspatial calibration model). The spatial calibration model can consider the effects of a spatially heterogenous pattern and explore spatial non-stationarity in the sensor calibration by allowing regression coefficients to vary spatially. Furthermore, the system in the sensor calibration is local linear with weights because sensor relationships exhibit spatially heterogeneous patterns.

Model residual plots

Examining residuals is a key part of statistical modeling. Figures 7 and 8 show the residual plots, residual histograms, and normal probability plot of residuals, respectively. The residuals of the nonspatial calibration model are between − 16 and 36 μg/m³, whereas the residuals of the spatial calibration model are between − 11 and 14 μg/m³. In Fig. 7, the overestimations of the PM_2.5 concentration values exist in most areas of Taiwan, but the underestimations of the PM_2.5 concentration values exist at the hotspots in the central and southern regions²¹. Hence, the spatial calibration model reduces the bias when compared to the nonspatial calibration model. The model, i.e., traditional linear regression, cannot work with the regional sensor data calibration, because the linear regression model is easily failed by spatial heterogeneity. After performing the calibration using spatial model, the results show the most random and normally distributed residual patterns in the spatial model (Fig. 8). The normal probability plot is a tool for assessing whether or not a data set is approximately normally distributed. The residual plot of the spatial calibration model shows a straight line, because the data set is approximately normally distributed. The smaller normally distributed residuals represent better model performance compared to the nonspatial methods.

Spatial mapping of PM_2.5

In the nonspatial and spatial calibration models, the mapping RMSE values of PM_2.5 concentrations are 7.9 and 4.8 μg/m³, respectively. The spatial calibration model can improve the bias and decrease the mapping of RMSE to 39% compared to the nonspatial calibration model. Figure 9 shows the PM_2.5 estimation based on the nonspatial and spatial calibration models and regulatory stations. The hotspots from the low-cost sensors in the nonspatial calibration (Fig. 9a) are underestimated in the central and southwestern areas of Taiwan. After calibration, the spatial pattern of air pollution from low-cost sensors are similar to the results of the regulatory stations (Fig. 9b,c). The results (Fig. 9b,c) have similar large-scale patterns but vary in terms of details and spatial distributions in the local area. In addition, the spatial PM_2.5 pattern from sparse regulatory stations are smooth and ideal. The spatial PM_2.5 distribution from the low-cost sensors after calibration are complete and reasonable. Considering a large number of calibrated low-cost sensors, the spatial details can be shown in the estimation. The spatial pattern of air pollution showed that the air pollution levels of eastern and western Taiwan varied. The hotspots of air pollution are in the central and southwestern areas of Taiwan.

Discussion

Strict collocation becomes a challenge when the low-cost sensor and regulatory networks are already established². In this study, a nearest neighbor strategy was used by matching a low-cost sensor to its nearest TWEPA station. The overall average nearest neighbor distance of low-cost sensors is approximately 610 m, but the average nearest neighbor distance decreases to 311 m after excluding 7 data pairs with distances over 2 km. The high density of low-cost sensor network and the reasonability of this nearest neighbor strategy allowed for sufficient collocated samples to conduct the calibration.

Previous studies highlighted the association of the environmental conditions with the observation biases in low-cost sensors, as well as the calibration function that must be changed across space and time⁷. When used under environmental conditions with high relative humidity (> 80%), appropriate automated correction routines must be established to remove problematic observations from the datasets or transform these observations based on regulatory stations¹. Increased relative humidity is associated with a near-exponentially increased low-cost sensor data bias¹³. The observed influences of high humidity on low-cost sensor bias may be related to the issues in electronic circuits and the hygroscopic growth of fine particulates². In a time series, the PM_2.5 bias and relative humidity show positive association, while the relative humidity is high, particularly over 85%^1,7. In this case, 43% of sensors exist with the relative humidity over 85%, and the total of 56% sensors exist with the relative humidity over 80%. We easily faced this condition because the area had humid subtropical climate in the north and a tropical monsoon climate in the south of Taiwan. Therefore, low-cost sensors with high relative humidity must be established to eliminate the possibility of problematic observations. Multi-variable models may be fitted to obtain reasonable formulas, which can be easily interpreted and implemented from each sensor²². However, the current spatial calibration model performs the best (4.8 μg/m³), because the model is already considered a local weighting for the spatial heterogeneity of calibration. Not considering the environmental variables further may simplify the low-cost sensor data spatial calibration. When considering the relative humidity into the spatial calibration model, the model RMSE increases to 5.4 μg/m³. This result implies that the relative humidity shows a negligible association with the spatial calibration. The relative humidity is not the influencing factor in the spatial model, because the high relative humidity spreads everywhere at this time (average relative humidity is 82.1%).

Meanwhile, the calibration limitations and the risk of overfitting exist in small calibration datasets²³. In practice, an appropriate and simple choice is to convert the low-cost sensor data to the reference system. The flexibility of an empirical spatial (location-specific) adjustment resulted in the consistent performance. After effective calibration, distributed low-cost sensors can help understand the spatial details of air pollution. In addition, data calibration commonly requires the space–time homogeneity of environmental conditions; however, the suitability of homogeneous assumption could not easily be validated⁷. If the environment is with high relative humidity, the real-time calibration for low-cost sensors is necessary. The calibration requires the real-time referenced and low-cost sensor data. The proposed approach did not find the general calibration equation, but needed to calibrate spatially at any time. In practice, the low-cost sensor will be real-time adjusted. In this case, computational time loading of the model is low (< 5 min) in the computer with Intel Core i5-10210U. The associations among low-cost sensors can be automatically converted to the reference system. In this case, the frequency of the monitoring systems of low-cost and regulatory sensors maybe inconsistent. The low-cost sensor updated rate is up to 5 min but the regulatory station is hourly updated. Thus, the real- or near real-time operation can only be set as the hourly calibration, then the real-time mapping of PM_2.5 can be applied based on the current data with hourly-calibrated functions.

Low-cost sensors are cheap, compact, user-friendly, and provide high-resolution spatiotemporal air pollutant concentration data³. However, several types of low-cost sensors are also available for air quality monitoring in which the calibration may be applied diversely. In this spatial calibration model, the location-based and individual calibration coefficients at any time are considered.

This study offers an effective approach for exploring the low-cost sensor data transformation using the spatial calibration model, which is a local weighting method and is easy to fit between low-cost sensor and regulatory stations. The spatial calibration and mapping model provide additional details on the spatial variations of sensor transformation, which can be identified from the nonlinear and spatially varying relationship between the low-cost sensor and regulatory stations. Given that the observations of low-cost sensors play a critical role, the bias of the PM_2.5 measurements from raw low-cost sensors can induce the biases of the risk perception from the public⁷. Even though the bias can be significantly improved through spatial calibration, the residuals can still be present in the result. For example, Fig. 9a,b show the differences in the observations from the low-cost sensors using nonspatial and spatial calibration. Such system biases were significantly reduced by the present calibration. Although the biases cannot be adjusted to exact zeros, the overall residuals were significantly mitigated in the spatial calibration process. Spatial calibration and mapping are very important in managing public perceptions of air quality levels. Using the spatial calibration, the spatial patterns of air pollution from low-cost sensors are consistent with those obtained from regulatory stations with high relative humidity. After spatial calibration, the pollution hotspots determined by the low-cost sensors can provide the detailed spatial patterns of PM_2.5 better when compared to using regulatory stations. The pollution hotspots which were measured from the calibrated low-cost sensors happened in the central and southwestern areas of Taiwan.

Conclusions

The study conducted spatial calibrations, which collected measurements of and investigated the spatial varying relations of low-cost sensors and regulatory stations. This work also presented the derivation of the spatial calibration approach for low-cost sensors given that the observed low-cost and regulatory measurements exhibited inconsistency and spatial heterogeneity.

The proposed model considered spatial regression and IDW interpolation for calibrating and mapping regional air quality. The low-cost sensor data can be adjusted using spatial calibration, especially in the environment with high relative humidity. Results showed that the proposed approach can explain the variability of the biases from the low-cost sensors with an R-square value of 0.94. The mapping RMSE values of PM_2.5 concentrations decreased to 4.8 μg/m³. The spatial calibration model can reduce the bias and decrease the mapping of RMSE to 39% compared to nonspatial calibration. Moreover, the use of regional maps after spatial calibration—instead of regulatory stations—provided spatial information on air pollution. Spatial pattern of air pollution showed the reliable hotspots of air pollution in the central area and in the southwestern area of Taiwan. The proposed model provides planners with relevant information by which to understand air pollution after the spatial calibration of low-cost sensors based on regulatory observations. The spatial calibration and mapping model can effectively mitigate the misleading pollution risk perceptions from decreasing the biases of low-cost sensors in the environment with high relative humidity. In the future work, we will consider more spatial information, such as elevation into the kernel function. In addition, the model is available for both real-time calibration and offline calibration. The offline calibration will be applied based on long-term monitoring data.

Real-time visualization is available at https://sites.google.com/gs.ncku.edu.tw/ttest/%E9%A6%96%E9%A0%81

References

Liu, H.-Y., Schneider, P., Haugen, R. & Vogt, M. Performance assessment of a low-cost PM2.5 sensor for a near four-month period in Oslo, Norway. Atmosphere 10, 41 (2019).
Article ADS CAS Google Scholar
Bi, J., Wildani, A., Chang, H. H. & Liu, Y. Incorporating low-cost sensor measurements into high-resolution PM2.5 modeling at a large spatial scale. Environ. Sci. Technol. 54, 2152–2162 (2020).
Article ADS CAS Google Scholar
Munir, S., Mayfield, M., Coca, D., Jubb, S. A. & Osammor, O. Analysing the performance of low-cost air quality sensors, their drivers, relative benefits and calibration in cities—a case study in Sheffield. Environ. Monit. Assess. 191, 94 (2019).
Article Google Scholar
Lin, Y.-C., Chi, W.-J. & Lin, Y.-Q. The improvement of spatial-temporal resolution of PM2.5 estimation based on micro-air quality sensors by using data fusion technique. Environ. Int. 134, 105305 (2020).
Article CAS Google Scholar
Chen, L.-J. et al. An open framework for participatory PM2.5 monitoring in smart cities. IEEE Access 5, 14441–14454 (2017).
Article Google Scholar
Karagulian, F. et al. Review of the performance of low-cost sensors for air quality monitoring. Atmosphere 10, 506 (2019).
Article ADS CAS Google Scholar
Lee, C.-H., Wang, Y.-B. & Yu, H.-L. An efficient spatiotemporal data calibration approach for the low-cost PM25 sensing network: a case study in Taiwan. Environ. Int. 130, 104838 (2019).
Article CAS Google Scholar
Holstius, D. M., Pillarisetti, A., Smith, K. & Seto, E. Field calibrations of a low-cost aerosol sensor at a regulatory monitoring site in California. Atmos. Meas. Tech. 7, 1121–1131 (2014).
Article Google Scholar
Morawska, L. et al. Applications of low-cost sensing technologies for air quality monitoring and exposure assessment: how far have they gone?. Environ. Int. 116, 286–299 (2018).
Article CAS Google Scholar
Wang, Y. et al. Laboratory evaluation and calibration of three low-cost particle sensors for particulate matter measurement. Aerosol Sci. Technol. 49, 1063–1077 (2015).
Article ADS CAS Google Scholar
Johnson, K. K., Bergin, M. H., Russell, A. G. & Hagler, G. S. Field test of several low-cost particulate matter sensors in high and low concentration urban environments. Aerosol Air Qual. Res. 18, 565 (2018).
Article CAS Google Scholar
Jayaratne, R., Liu, X., Thai, P., Dunbabin, M. & Morawska, L. The influence of humidity on the performance of a low-cost air particle mass sensor and the effect of atmospheric fog. Atmos. Meas. Tech. 11, 4883–4890 (2018).
Article CAS Google Scholar
Crilley, L. R. et al. Effect of aerosol composition on the performance of low-cost optical particle counter correction factors. Atmos. Meas. Tech. 13, 1181–1193 (2020).
Article CAS Google Scholar
Williams, R. et al. Air Sensor Guidebook (US Environmental Protection Agency, Washington, DC, 2014).
Google Scholar
Wang, Y., Du, Y., Wang, J. & Li, T. Calibration of a low-cost PM2.5 monitor using a random forest model. Environ. Int. 133, 105161 (2019).
Article CAS Google Scholar
Magi, B. I., Cupini, C., Francis, J., Green, M. & Hauser, C. Evaluation of PM25 measured in an urban setting using a low-cost optical particle counter and a Federal equivalent method beta attenuation monitor. Aerosol Sci. Technol. 54, 147–159 (2020).
Article ADS CAS Google Scholar
Spinelle, L., Gerboles, M., Villani, M. G., Aleixandre, M. & Bonavitacola, F. Field calibration of a cluster of low-cost available sensors for air quality monitoring. Part A: Ozone and nitrogen dioxide. Sens. Actuators B: Chem. 215, 249–257 (2015).
Article CAS Google Scholar
Zimmerman, N. et al. A machine learning calibration model using random forests to improve sensor performance for lower-cost air quality monitoring. Atmos. Meas. Tech. 11, 291–313 (2018).
Article Google Scholar
Yuan, Z. An Note on Why Geographically Weighted Regression Overcomes Multidimensional-Kernel-Based Varying-Coefficient Model (2018). arXiv:1803.01402
Fotheringham, A. S., Brunsdon, C. & Charlton, M. Geographically Weighted Regression: The Analysis of Spatially Varying Relationships (Wiley, Hoboken, 2003).
MATH Google Scholar
Chu, H.-J. & Ali, M. Z. Establishment of regional concentration–duration–frequency relationships of air pollution: A case study for PM2.5. Int. J. Environ. Res. Public Health 17, 1419 (2020).
Article Google Scholar
Badura, M., Batog, P., Drzeniecka-Osiadacz, A. & Modzel, P. Regression methods in the calibration of low-cost sensors for ambient particulate matter measurements. SN Appl. Sci. 1, 622 (2019).
Article Google Scholar
Zusman, M. et al. Calibration of low-cost particulate matter sensors: model development for a multi-city epidemiological study. Environ. Int. 134, 105329 (2020).
Article Google Scholar

Download references

Acknowledgments

We are very grateful for the financial assistance from our Ministry of Science and Technology (108-2621-M-006-010-). The authors thank PM2.5 Open Data Portal and EPA's open data. Data are openly available in low-cost sensor and EPA data, Taiwan. Eventually, the authors would like to thank the editors and anonymous reviewers for providing suggestions of paper improvement. We also appreciate Prof. S. M. Chang, Dep. of Stat., NCKU for the check in model formulation.

Author information

Authors and Affiliations

National Cheng Kung University, Tainan City, Taiwan
Hone-Jay Chu, Muhammad Zeeshan Ali & Yu-Chen He

Authors

Hone-Jay Chu
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Zeeshan Ali
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Chen He
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.-J.C. wrote the main manuscript and generated the model; M.Z.A. prepared the original figures; Y.-C.H. prepared the revised figures.

Corresponding author

Correspondence to Hone-Jay Chu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chu, HJ., Ali, M.Z. & He, YC. Spatial calibration and PM_2.5 mapping of low-cost air quality sensors. Sci Rep 10, 22079 (2020). https://doi.org/10.1038/s41598-020-79064-w

Download citation

Received: 15 July 2020
Accepted: 04 November 2020
Published: 16 December 2020
DOI: https://doi.org/10.1038/s41598-020-79064-w

This article is cited by

Probabilistic assessment of spatiotemporal fine particulate matter concentrations in Taiwan using multivariate indicator kriging
- Cheng-Shin Jang
Stochastic Environmental Research and Risk Assessment (2024)
Utilizing a Low-Cost Air Quality Sensor: Assessing Air Pollutant Concentrations and Risks Using Low-Cost Sensors in Selangor, Malaysia
- Zaki Khaslan
- Mohd Shahrul Mohd Nadzir
- Mylene G. Cayetano
Water, Air, & Soil Pollution (2024)
Performance-based protocol for selection of economical portable sensor for air quality measurement
- Nidhi Shukla
- Sunil Gulia
- S. K. Goyal
Environmental Monitoring and Assessment (2023)
Data Quality in IoT-Based Air Quality Monitoring Systems: a Systematic Mapping Study
- Julio Buelvas
- Danny Múnera
- Natalia Gaviria
Water, Air, & Soil Pollution (2023)
Modeling fine-grained spatio-temporal pollution maps with low-cost sensors
- Shiva R. Iyer
- Ananth Balashankar
- Lakshminarayanan Subramanian
npj Climate and Atmospheric Science (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.