A Machine Learning Approach to Predict Air Quality in California
Mauro Castelli,Fabiana Martins Clemente,Aleš Popovič,Aleš Popovič,Sara Silva,Leonardo Vanneschi,Leonardo Vanneschi +6 more
TLDR
This paper employs a popular machine learning method, support vector regression (SVR), to forecast pollutant and particulate levels and to predict the air quality index (AQI), and demonstrates that SVR with RBF kernel allows us to accurately predict hourly pollutant concentrations.Abstract:
Predicting air quality is a complex task due to the dynamic nature, volatility, and high variability in time and space of pollutants and particulates. At the same time, being able to model, predict, and monitor air quality is becoming more and more relevant, especially in urban areas, due to the observed critical impact of air pollution on citizens’ health and the environment. In this paper, we employ a popular machine learning method, support vector regression (SVR), to forecast pollutant and particulate levels and to predict the air quality index (AQI). Among the various tested alternatives, radial basis function (RBF) was the type of kernel that allowed SVR to obtain the most accurate predictions. Using the whole set of available variables revealed a more successful strategy than selecting features using principal component analysis. The presented results demonstrate that SVR with RBF kernel allows us to accurately predict hourly pollutant concentrations, like carbon monoxide, sulfur dioxide, nitrogen dioxide, ground-level ozone, and particulate matter 2.5, as well as the hourly AQI for the state of California. Classification into six AQI categories defined by the US Environmental Protection Agency was performed with an accuracy of 94.1% on unseen validation data.read more
Citations
More filters
Journal ArticleDOI
Machine learning
TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
Journal ArticleDOI
Development of air quality monitoring (AQM) models using different machine learning approaches
TL;DR: In this article, Machine Learning(ML) Methods such as Non-Linear Artificial Neural Network(ANN), Statistical Multilevel Regression, Neuro-Fuzzy and Deep Learning Long-Short-Term Memory (DL-LSTM) are used to find the current concentration level of pollutants and will be useful for Real Time Correction (RTC) to give a feedback that can be used to reduce the contaminants in air for further days.
Journal ArticleDOI
Air pollution prediction with machine learning: a case study of Indian cities
K. Saravana Kumar,B. P. Pande +1 more
TL;DR: In this article , the authors investigated six years of air pollution data from 23 Indian cities for air quality analysis and prediction, and five machine learning models were employed to predict air quality.
Journal ArticleDOI
America??s Children and the Environment: Children??s Environmental Health Indicators for the US
T Woodruff,A Kyle,A Daniel +2 more
Journal ArticleDOI
Prediction of air pollutants on roadside of the elevated roads with combination of pollutants periodicity and deep learning method
TL;DR: Wang et al. as discussed by the authors quantified the environmental impacts of the elevated road location on four traffic pollutants (e.g., NO, NO2, CO, and O3) in Shanghai, using the two-year observation data from Shanghai roadside air quality monitoring stations.
References
More filters
Journal ArticleDOI
Support-Vector Networks
Corinna Cortes,Vladimir Vapnik +1 more
TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.
Journal ArticleDOI
Machine learning
TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
Proceedings Article
Support Vector Regression Machines
TL;DR: This work compares support vector regression (SVR) with a committee regression technique (bagging) based on regression trees and ridge regression done in feature space and expects that SVR will have advantages in high dimensionality space because SVR optimization does not depend on the dimensionality of the input space.
Journal ArticleDOI
Distribution of Residual Autocorrelations in Autoregressive-Integrated Moving Average Time Series Models
George E. P. Box,David A. Pierce +1 more
TL;DR: In this paper, it is shown that the residual autocorrelations are to a close approximation representable as a singular linear transformation of the auto-correlations of the errors so that they possess a singular normal distribution.
Journal ArticleDOI
Support Vector Machines for classification and regression
TL;DR: The increasing interest in Support Vector Machines (SVMs) over the past 15 years is described, including its application to multivariate calibration, and why it is useful when there are outliers and non-linearities.
Related Papers (5)
Application of nonlinear land use regression models for ambient air pollutants and air quality index
A new air quality monitoring and early warning system: Air quality assessment and air pollutant concentration prediction.
Zhongshan Yang,Jian Wang +1 more
Enhanced transport-related air pollution prediction through a novel metamodel approach
Mario Catalano,Fabio Galatioto +1 more