scispace - formally typeset
Open AccessJournal ArticleDOI

A Machine Learning Approach to Predict Air Quality in California

TLDR
This paper employs a popular machine learning method, support vector regression (SVR), to forecast pollutant and particulate levels and to predict the air quality index (AQI), and demonstrates that SVR with RBF kernel allows us to accurately predict hourly pollutant concentrations.
Abstract
Predicting air quality is a complex task due to the dynamic nature, volatility, and high variability in time and space of pollutants and particulates. At the same time, being able to model, predict, and monitor air quality is becoming more and more relevant, especially in urban areas, due to the observed critical impact of air pollution on citizens’ health and the environment. In this paper, we employ a popular machine learning method, support vector regression (SVR), to forecast pollutant and particulate levels and to predict the air quality index (AQI). Among the various tested alternatives, radial basis function (RBF) was the type of kernel that allowed SVR to obtain the most accurate predictions. Using the whole set of available variables revealed a more successful strategy than selecting features using principal component analysis. The presented results demonstrate that SVR with RBF kernel allows us to accurately predict hourly pollutant concentrations, like carbon monoxide, sulfur dioxide, nitrogen dioxide, ground-level ozone, and particulate matter 2.5, as well as the hourly AQI for the state of California. Classification into six AQI categories defined by the US Environmental Protection Agency was performed with an accuracy of 94.1% on unseen validation data.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Machine learning

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
Journal ArticleDOI

Development of air quality monitoring (AQM) models using different machine learning approaches

TL;DR: In this article, Machine Learning(ML) Methods such as Non-Linear Artificial Neural Network(ANN), Statistical Multilevel Regression, Neuro-Fuzzy and Deep Learning Long-Short-Term Memory (DL-LSTM) are used to find the current concentration level of pollutants and will be useful for Real Time Correction (RTC) to give a feedback that can be used to reduce the contaminants in air for further days.
Journal ArticleDOI

Air pollution prediction with machine learning: a case study of Indian cities

TL;DR: In this article , the authors investigated six years of air pollution data from 23 Indian cities for air quality analysis and prediction, and five machine learning models were employed to predict air quality.
Journal ArticleDOI

Prediction of air pollutants on roadside of the elevated roads with combination of pollutants periodicity and deep learning method

TL;DR: Wang et al. as discussed by the authors quantified the environmental impacts of the elevated road location on four traffic pollutants (e.g., NO, NO2, CO, and O3) in Shanghai, using the two-year observation data from Shanghai roadside air quality monitoring stations.
References
More filters
Journal ArticleDOI

Support-Vector Networks

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.
Journal ArticleDOI

Machine learning

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
Proceedings Article

Support Vector Regression Machines

TL;DR: This work compares support vector regression (SVR) with a committee regression technique (bagging) based on regression trees and ridge regression done in feature space and expects that SVR will have advantages in high dimensionality space because SVR optimization does not depend on the dimensionality of the input space.
Journal ArticleDOI

Distribution of Residual Autocorrelations in Autoregressive-Integrated Moving Average Time Series Models

TL;DR: In this paper, it is shown that the residual autocorrelations are to a close approximation representable as a singular linear transformation of the auto-correlations of the errors so that they possess a singular normal distribution.
Journal ArticleDOI

Support Vector Machines for classification and regression

TL;DR: The increasing interest in Support Vector Machines (SVMs) over the past 15 years is described, including its application to multivariate calibration, and why it is useful when there are outliers and non-linearities.
Related Papers (5)