scispace - formally typeset
Open AccessJournal ArticleDOI

A systematic review of data mining and machine learning for air pollution epidemiology.

Reads0
Chats0
TLDR
This work shows that data mining is increasingly being applied in air pollution epidemiology, and identifies that deep learning and geo-spacial pattern mining are two burgeoning areas of data mining that have good potential for future applications in airpoll epidemiology.
Abstract
Data measuring airborne pollutants, public health and environmental factors are increasingly being stored and merged. These big datasets offer great potential, but also challenge traditional epidemiological methods. This has motivated the exploration of alternative methods to make predictions, find patterns and extract information. To this end, data mining and machine learning algorithms are increasingly being applied to air pollution epidemiology. We conducted a systematic literature review on the application of data mining and machine learning methods in air pollution epidemiology. We carried out our search process in PubMed, the MEDLINE database and Google Scholar. Research articles applying data mining and machine learning methods to air pollution epidemiology were queried and reviewed. Our search queries resulted in 400 research articles. Our fine-grained analysis employed our inclusion/exclusion criteria to reduce the results to 47 articles, which we separate into three primary areas of interest: 1) source apportionment; 2) forecasting/prediction of air pollution/quality or exposure; and 3) generating hypotheses. Early applications had a preference for artificial neural networks. In more recent work, decision trees, support vector machines, k-means clustering and the APRIORI algorithm have been widely applied. Our survey shows that the majority of the research has been conducted in Europe, China and the USA, and that data mining is becoming an increasingly common tool in environmental health. For potential new directions, we have identified that deep learning and geo-spacial pattern mining are two burgeoning areas of data mining that have good potential for future applications in air pollution epidemiology. We carried out a systematic review identifying the current trends, challenges and new directions to explore in the application of data mining methods to air pollution epidemiology. This work shows that data mining is increasingly being applied in air pollution epidemiology. The potential to support air pollution epidemiology continues to grow with advancements in data mining related to temporal and geo-spacial mining, and deep learning. This is further supported by new sensors and storage mediums that enable larger, better quality data. This suggests that many more fruitful applications can be expected in the future.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Machine learning and statistical models for predicting indoor air quality.

TL;DR: The first literature review of the use of statistical models to predict IAQ was carried out, indicating the emergence of the awareness and application of machine learning and statistical modeling in the field of IAQ.
Journal ArticleDOI

An overview of GeoAI applications in health and healthcare.

TL;DR: An overview of GeoAI technologies (methods, tools and software), and their current and potential applications in several disciplines within public health, precision medicine, and Internet of Things-powered smart healthy cities is provided.
Journal ArticleDOI

Associations between respiratory health and ozone and fine particulate matter during a wildfire event.

TL;DR: During the active fire periods, PM2.5 was significantly associated with exacerbations of asthma and chronic obstructive pulmonary disease (COPD) and these effects remained after controlling for O3 and the relative risks of respiratory health outcomes were calculated using Poisson generalized estimating equations models.
Journal ArticleDOI

A picture tells a thousand…exposures: Opportunities and challenges of deep learning image analyses in exposure science and environmental epidemiology.

TL;DR: The promise of deep learning in environmental health is great and will complement existing measurements for data-rich settings and could enhance the resolution and accuracy of estimates in data poor scenarios.
Journal ArticleDOI

A multimethod approach for county-scale geospatial analysis of emerging infectious diseases: a cross-sectional case study of COVID-19 incidence in Germany.

TL;DR: The multimethod ESDA approach provided unique insights into spatial and aspatial non-stationarities of COVID-19 incidence in Germany and suggested that measures to implement social distancing and reduce unnecessary travel may be important methods for reducing contagion.
References
More filters
Journal ArticleDOI

Deep learning

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.
Journal Article

Preferred reporting items for systematic reviews and meta-analyses: the PRISMA Statement.

TL;DR: The QUOROM Statement (QUality Of Reporting Of Meta-analyses) as mentioned in this paper was developed to address the suboptimal reporting of systematic reviews and meta-analysis of randomized controlled trials.
Book

The Nature of Statistical Learning Theory

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?
Journal ArticleDOI

Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement

TL;DR: A structured summary is provided including, as applicable, background, objectives, data sources, study eligibility criteria, participants, interventions, study appraisal and synthesis methods, results, limitations, conclusions and implications of key findings.
Related Papers (5)