scispace - formally typeset
Search or ask a question
Posted Content

A Gap Analysis of Low-Cost Outdoor Air Quality Sensor In-Field Calibration

TL;DR: This article presents low-cost sensor technologies, and it survey and assess machine learning-based calibration techniques for their calibration, and presents open questions and directions for future research.
Abstract: In recent years, interest in monitoring air quality has been growing. Traditional environmental monitoring stations are very expensive, both to acquire and to maintain, therefore their deployment is generally very sparse. This is a problem when trying to generate air quality maps with a fine spatial resolution. Given the general interest in air quality monitoring, low-cost air quality sensors have become an active area of research and development. Low-cost air quality sensors can be deployed at a finer level of granularity than traditional monitoring stations. Furthermore, they can be portable and mobile. Low-cost air quality sensors, however, present some challenges: they suffer from crosssensitivities between different ambient pollutants; they can be affected by external factors such as traffic, weather changes, and human behavior; and their accuracy degrades over time. Some promising machine learning approaches can help us obtain highly accurate measurements with low-cost air quality sensors. In this article, we present low-cost sensor technologies, and we survey and assess machine learning-based calibration techniques for their calibration. We conclude by presenting open questions and directions for future research.
Citations
More filters
01 Jul 2018
TL;DR: In this article, the authors conducted a comprehensive literature search including both the scientific and grey literature, and concluded that there is no clear answer to the question, due to a lack of: sensor/monitor manufacturers' quantitative specifications of performance, consensus regarding recommended end-use and associated minimal performance targets of these technologies, and the ability of the prospective users to formulate the requirements for their applications, or conditions of the intended use.
Abstract: Over the past decade, a range of sensor technologies became available on the market, enabling a revolutionary shift in air pollution monitoring and assessment. With their cost of up to three orders of magnitude lower than standard/reference instruments, many avenues for applications have opened up. In particular, broader participation in air quality discussion and utilisation of information on air pollution by communities has become possible. However, many questions have been also asked about the actual benefits of these technologies. To address this issue, we conducted a comprehensive literature search including both the scientific and grey literature. We focused upon two questions: (1) Are these technologies fit for the various purposes envisaged? and (2) How far have these technologies and their applications progressed to provide answers and solutions? Regarding the former, we concluded that there is no clear answer to the question, due to a lack of: sensor/monitor manufacturers' quantitative specifications of performance, consensus regarding recommended end-use and associated minimal performance targets of these technologies, and the ability of the prospective users to formulate the requirements for their applications, or conditions of the intended use. Numerous studies have assessed and reported sensor/monitor performance under a range of specific conditions, and in many cases the performance was concluded to be satisfactory. The specific use cases for sensors/monitors included outdoor in a stationary mode, outdoor in a mobile mode, indoor environments and personal monitoring. Under certain conditions of application, project goals, and monitoring environments, some sensors/monitors were fit for a specific purpose. Based on analysis of 17 large projects, which reached applied outcome stage, and typically conducted by consortia of organizations, we observed that a sizable fraction of them (~ 30%) were commercial and/or crowd-funded. This fact by itself signals a paradigm change in air quality monitoring, which previously had been primarily implemented by government organizations. An additional paradigm-shift indicator is the growing use of machine learning or other advanced data processing approaches to improve sensor/monitor agreement with reference monitors. There is still some way to go in enhancing application of the technologies for source apportionment, which is of particular necessity and urgency in developing countries. Also, there has been somewhat less progress in wide-scale monitoring of personal exposures. However, it can be argued that with a significant future expansion of monitoring networks, including indoor environments, there may be less need for wearable or portable sensors/monitors to assess personal exposure. Traditional personal monitoring would still be valuable where spatial variability of pollutants of interest is at a finer resolution than the monitoring network can resolve.

138 citations

Journal ArticleDOI
TL;DR: In this paper, low-cost air quality sensors (LCAQS) can be deployed in dense monitoring networks to provide timely and comprehensive snapshots of pollutant concentrations and their spatial and temporal variability at various scales with relatively less cost and labor.
Abstract: As a potential complement to traditional regulatory instruments, low-cost air quality sensors (LCAQS) can be deployed in dense monitoring networks to provide timely and comprehensive snapshots of pollutant concentrations and their spatial and temporal variability at various scales with relatively less cost and labor. However, a lack of practical guidance and a limited understanding of sensor data quality hinder the widespread application of this emerging technology. We leveraged air quality data collected from state and local monitoring agencies in metropolitan areas of the United States to evaluate how low-cost sensors could be deployed across the U.S. We found that ozone, as a secondary pollutant, is more homogeneous than other pollutants at various scales. PM2.5, CO, and NO2 displayed homogeneities that varied by city, making it challenging to design a uniform network that was suitable across geographies. Our low-cost sensor data in New York City indicated that PM2.5 sensors track well with light-scattering reference methods, particularly at low concentrations. The same phenomenon was also found after thoroughly evaluating sensor evaluation reports from the Air Quality Sensor Performance Evaluation Center (AQ-SPEC). Furthermore, LCAQS data collected during wildfire episodes in Portland, OR show that a real-time (i.e. in situ) machine learning calibration process is a promising approach to address the data quality challenges persisting in LCAQS applications. Our research highlights the urgency and importance of practical guidance for deploying LCAQS.

7 citations

References
More filters
Journal ArticleDOI
TL;DR: A general gradient descent boosting paradigm is developed for additive expansions based on any fitting criterion, and specific algorithms are presented for least-squares, least absolute deviation, and Huber-M loss functions for regression, and multiclass logistic likelihood for classification.
Abstract: Function estimation/approximation is viewed from the perspective of numerical optimization in function space, rather than parameter space. A connection is made between stagewise additive expansions and steepest-descent minimization. A general gradient descent “boosting” paradigm is developed for additive expansions based on any fitting criterion.Specific algorithms are presented for least-squares, least absolute deviation, and Huber-M loss functions for regression, and multiclass logistic likelihood for classification. Special enhancements are derived for the particular case where the individual additive components are regression trees, and tools for interpreting such “TreeBoost” models are presented. Gradient boosting of regression trees produces competitive, highly robust, interpretable procedures for both regression and classification, especially appropriate for mining less than clean data. Connections between this approach and the boosting methods of Freund and Shapire and Friedman, Hastie and Tibshirani are discussed.

17,764 citations


"A Gap Analysis of Low-Cost Outdoor ..." refers background in this paper

  • ...Example of boosting algorithms are adaptive boosting (AdaBoost) [118], gradient boosting (GB) [119], and extreme gradient boosting (XGBoost) [120]....

    [...]

Proceedings ArticleDOI
13 Aug 2016
TL;DR: XGBoost as discussed by the authors proposes a sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning to achieve state-of-the-art results on many machine learning challenges.
Abstract: Tree boosting is a highly effective and widely used machine learning method. In this paper, we describe a scalable end-to-end tree boosting system called XGBoost, which is used widely by data scientists to achieve state-of-the-art results on many machine learning challenges. We propose a novel sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning. More importantly, we provide insights on cache access patterns, data compression and sharding to build a scalable tree boosting system. By combining these insights, XGBoost scales beyond billions of examples using far fewer resources than existing systems.

14,872 citations

01 Jan 1996

7,386 citations

Journal ArticleDOI
TL;DR: It is the opinion of the writing group that the overall evidence is consistent with a causal relationship between PM2.5 exposure and cardiovascular morbidity and mortality.
Abstract: In 2004, the first American Heart Association scientific statement on “Air Pollution and Cardiovascular Disease” concluded that exposure to particulate matter (PM) air pollution contributes to card...

5,227 citations


"A Gap Analysis of Low-Cost Outdoor ..." refers background in this paper

  • ...Besides having a direct effect on mortality, air pollution is strongly associated with a broad spectrum of acute and chronic diseases, including cardiovascular diseases [2, 3, 4], lung diseases [5, 6, 7], several types of cancer [3, 8, 9], and even conditions ar X iv :1 91 2....

    [...]

Journal ArticleDOI
01 Mar 1996
TL;DR: The article discusses the motivations behind the development of ANNs and describes the basic biological neuron and the artificial computational model, and outlines network architectures and learning processes, and presents some of the most commonly used ANN models.
Abstract: Artificial neural nets (ANNs) are massively parallel systems with large numbers of interconnected simple processors. The article discusses the motivations behind the development of ANNs and describes the basic biological neuron and the artificial computational model. It outlines network architectures and learning processes, and presents some of the most commonly used ANN models. It concludes with character recognition, a successful ANN application.

4,281 citations


"A Gap Analysis of Low-Cost Outdoor ..." refers methods in this paper

  • ...ANNs have been initially motivated by the structure of the human brain [122] and are composed of a set of nodes called neurons, that are grouped into layers....

    [...]