scispace - formally typeset
Search or ask a question
Journal ArticleDOI

A novel bagging ensemble approach for predicting summertime ground-level ozone concentration.

01 Feb 2019-Journal of The Air & Waste Management Association (Taylor & Francis)-Vol. 69, Iss: 2, pp 220-233
TL;DR: The feasibility of using ensemble model with seven meteorological parameters as input variables to predict the surface level O3 concentration andagged random forest predicted the ground level ozone better with higher Nash-Sutcliffe coefficient than conventional models.
Abstract: Ozone pollution appears as a major air quality issue, e.g. for the protection of human health and vegetation. Formation of ground level ozone is a complex photochemical phenomenon and involves numerous intricate factors most of which are interrelated with each other. Machine learning techniques can be adopted to predict the ground level ozone. The main objective of the present study is to develop the state-of-the-art ensemble bagging approach to model the summer time ground level ozone in an industrial area comprising a hazardous waste management facility. In this study, the feasibility of using ensemble model with seven meteorological parameters as input variables to predict the surface level O3 concentration. Multilayer perceptron, RTree, REPTree, and Random forest were employed as the base learners. The error measures used for checking the performance of each model includes IoAd, R2, and PEP. The model results were validated against an independent test data set. Bagged random forest predicted the ground level ozone better with higher Nash-Sutcliffe coefficient 0.93. This study scaffolded the current research gap in big data analysis identified with air pollutant prediction. Implications: The main focus of this paper is to model the summer time ground level O3 concentration in an Industrial area comprising of hazardous waste management facility. Comparison study was made between the base classifiers and the ensemble classifiers. Most of the conventional models can well predict the average concentrations. In this case the peak concentrations are of importance as it has serious effect on human health and environment. The models developed should also be homoscedastic.
Citations
More filters
Journal ArticleDOI
TL;DR: In this article, the performance of photocatalysts is evaluated in terms of quantum yield, space-time yield, and other operational variables, including mode of operation, irradiation time, and relative humidity.
Abstract: Hydrogen sulfide (H2S) is regarded as a broad-spectrum poison associated with severe health consequences. Among the available treatment options, photocatalytic technology may be effectively applied to the production of hydrogen gas through the splitting of H2S molecules and the addition of 79.9 kJ mol−1 of energy. As a result, advanced photo-reactive media may provide a win-win strategy to treat the parent pollutant (H2S) while producing hydrogen gas. This review encompasses both TiO2 and non-TiO2 catalysts capable of operating under ultraviolet, visible, and solar light irradiation. The performances of photocatalysts are assessed in terms of quantum yield, space-time yield, and other operational variables, including mode of operation, irradiation time, and relative humidity. The concept of space velocity is used to compare photocatalysts in reference to benchmark parameters for the treatment of H2S. This review addresses current limitations and future prospects of the application of photocatalytic technology to efficiently mitigate H2S pollution.

68 citations

Journal ArticleDOI
TL;DR: This study critically investigates, analyses, and summarizes the existing soft computing modeling approaches in air quality modeling, and reviews and discusses artificial neural network (ANN), support vector machine (SVM), evolutionary ANN and SVM, the fuzzy logic model, neuro-fuzzy systems, the deep learning model, ensemble, and other hybrid models.
Abstract: Air quality models simulate the atmospheric environment systems and provide increased domain knowledge and reliable forecasting. They provide early warnings to the population and reduce the number of measuring stations. Due to the complexity and non-linear behavior associated with air quality data, soft computing models became popular in air quality modeling (AQM). This study critically investigates, analyses, and summarizes the existing soft computing modeling approaches. Among the many soft computing techniques in AQM, this article reviews and discusses artificial neural network (ANN), support vector machine (SVM), evolutionary ANN and SVM, the fuzzy logic model, neuro-fuzzy systems, the deep learning model, ensemble, and other hybrid models. Besides, it sheds light on employed input variables, data processing approaches, and targeted objective functions during modeling. It was observed that many advanced, reliable, and self-organized soft computing models like functional network, genetic programming, type-2 fuzzy logic, genetic fuzzy, genetic neuro-fuzzy, and case-based reasoning are rarely explored in AQM. Therefore, the partially explored and unexplored soft computing techniques can be appropriate choices for research in the field of air quality modeling. The discussion in this paper will help to determine the suitability and appropriateness of a particular model for a specific modeling context.

20 citations

Journal ArticleDOI
TL;DR: In this paper , a general framework for creating ensembles in the context of classification is proposed, which consists of four stages: objectives, data preparing, model training, and model testing.

7 citations

Journal ArticleDOI
TL;DR: In this paper , the authors conducted a literature review on full-scale big data in forecasting research and provided an overall review of big data-based forecasting research, details what, where and how big data improved prediction, and offers insights into future prospects.

6 citations

Journal ArticleDOI
TL;DR: In this article, the authors conduct a literature review on full-scale big data analytics in forecasting research and provide insights into future prospects, including where forecasting hotspots and analysis and forecasting methods used.

6 citations

References
More filters
Journal ArticleDOI
TL;DR: In this article, the principles governing the application of the conceptual model technique to river flow forecasting are discussed and the necessity for a systematic approach to the development and testing of the model is explained and some preliminary ideas suggested.

19,601 citations


"A novel bagging ensemble approach f..." refers background in this paper

  • ...R2 is calculated since it is sensitive to the differences in observed and modeled means and variances (Nash and Sutcliffe 1970)....

    [...]

Book ChapterDOI
21 Jun 2000
TL;DR: Some previous studies comparing ensemble methods are reviewed, and some new experiments are presented to uncover the reasons that Adaboost does not overfit rapidly.
Abstract: Ensemble methods are learning algorithms that construct a set of classifiers and then classify new data points by taking a (weighted) vote of their predictions. The original ensemble method is Bayesian averaging, but more recent algorithms include error-correcting output coding, Bagging, and boosting. This paper reviews these methods and explains why ensembles can often perform better than any single classifier. Some previous studies comparing ensemble methods are reviewed, and some new experiments are presented to uncover the reasons that Adaboost does not overfit rapidly.

5,679 citations


"A novel bagging ensemble approach f..." refers methods in this paper

  • ...Ensemble methods have the advantage of reducing these key shortcomings of standard learning algorithms (Dietterich 2000)....

    [...]

Journal ArticleDOI
TL;DR: It is shown that both the approximation accuracy and execution speed of gradient boosting can be substantially improved by incorporating randomization into the procedure.

5,355 citations


"A novel bagging ensemble approach f..." refers methods in this paper

  • ...Bagging decreases the residual error between the observed and predicted values by creating bootstrapped replica data sets (Friedman 2002)....

    [...]

Journal ArticleDOI
TL;DR: In this paper, it is suggested that the correlation coefficieness between observed and simulated variates is not as good as observed variates, and that correlation can be improved.
Abstract: Traditional methods of evaluating geographic models by statistical comparisons between observed and simulated variates are criticized. In particular, it is suggested that the correlation coefficien...

3,761 citations


"A novel bagging ensemble approach f..." refers background in this paper

  • ...It is calculated as the ratio between the mean squared error and the potential error (Willmott 1981)....

    [...]

Book
24 Nov 1999
TL;DR: A detailed overview of the chemistry of Polluted and Remote Atmospheres can be found in this paper, where the OZIPR model is used to simulate the formation of gases and particles in the Troposphere.
Abstract: Overview of the Chemistry of Polluted and Remote Atmospheres. The Atmospheric System. Spectroscopy and Photochemistry: Fundamentals. Photochemistry of Important Atmospheric Species. Kinetics and Atmospheric Chemistry. Rates and Mechanisms of Gas-Phase Reactions in Irradiated Organic-NOx-Air Mixtures. Chemistry of Inorganic Nitrogen Compounds. Acid Deposition: Formation and Fates of Inorganic and Organic Acids in the Troposphere. Particles in the Troposphere. Airborne Polycyclic Aromatic Hydrocarbons and Their Derivatives: Atmospheric Chemistry and Toxicological Implications. Analytical Methods and Typical Atmospheric Concentrations for Gases and Particles. Homogeneous and Heterogeneous Chemistry in the Stratosphere. Scientific Basis for Control of Halogenated Organics. Global Tropospheric Chemistry and Climate Change. Indoor Air Pollution: Sources, Levels, Chemistry, and Fates. Applications of Atmospheric Chemistry: Air Pollution Control Strategies and Risk Assessments for Tropospheric Ozone and Associated Photochemical Oxidants, Acids, Particles, and Hazardous Air Pollutants. Appendix I: Enthalpies of Formation of Some Gaseous Molecules, Atoms, and Free Radicals at 298 K. Appendix II: Bond Dissociation Energies. Appendix III: Running the OZIPR Model. Appendix IV: Some Relevant Web Sites. Appendix V: Pressures and Temperatures for Standard Atmosphere. Appendix VI: Answers to Selected Problems. Subject Index.

2,051 citations