scispace - formally typeset
Open AccessJournal ArticleDOI

Proposition of New Ensemble Data-Intelligence Models for Surface Water Quality Prediction

TLDR
In this paper, five different ensemble machine learning (ML) models including Quantile regression forest (QRF), Random Forest (RF), radial support vector machine (SVM), Stochastic Gradient Boosting (GBM).
Abstract
An accurate prediction of water quality (WQ) related parameters is considered as pivotal decisive tool in sustainable water resources management. In this study, five different ensemble machine learning (ML) models including Quantile regression forest (QRF), Random Forest (RF), radial support vector machine (SVM), Stochastic Gradient Boosting (GBM) and Gradient Boosting Machines (GBM_H2O) were developed to predict the monthly biochemical oxygen demand (BOD) values of the Euphrates River, Iraq. For this aim, monthly average data of water temperature (T), Turbidity, pH, Electrical Conductivity (EC), Alkalinity (Alk), Calcium (Ca), chemical oxygen demand (COD), Sulfate (SO4), total dissolved solids (TDS), total suspended solids (TSS), and BOD measured for ten years period were used in this study. The performances of these standalone models were compared with integrative models developed by coupling the applied ML models with two different feature extraction algorithms i.e., Genetic Algorithm (GA) and Principal Components Analysis (PCA). The reliability of the applied models was evaluated based on the statistical performance criteria of determination coefficient (R2), root mean square error (RMSE), mean absolute error (MAE), Nash-Sutcliffe model efficiency coefficient (NSE), Willmott index (d), and percent bias (PBIAS). Results showed that among the developed models, QRF model attained the superior performance. The performance of the evaluated models presented in this study proved that the developed integrative PCA-QRF model presented much better performance compared with the standalone ones and with those integrated with GA. The statistical criteria of R2, RMSE, MAE, NSE, d, and PBIAS of PCA-QRF were 0.94, 0.12, 0.05, 0.93, 0.98, and 0.3, respectively.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Variational mode decomposition based random forest model for solar radiation forecasting: New emerging machine learning technology

TL;DR: In this article, a variational mode decomposition method is integrated with simulated annealing and random forest (VMD-SA-RF) for resolving the nonlinearity and nonstationarity challenges caused by meteorological variables in forecasting Radn.
Journal ArticleDOI

The assessment of emerging data-intelligence technologies for modeling Mg+2 and SO4-2 surface water quality.

TL;DR: In this article, two wavelet-complementary intelligence paradigms, wavelet least square support vector machine coupled with improved simulated annealing (W-LSSVM-ISA) and the wavelet extended Kalman filter integrated with artificial neural network, were used for accurate forecasting of the monthly), magnesium (Mg+2), and sulfate (SO4−2) indices at Maroon River, in Southwest of Iran.
Journal ArticleDOI

New generation neurocomputing learning coupled with a hybrid neuro-fuzzy model for quantifying water quality index variable: A case study from Saudi Arabia

TL;DR: In this paper , an uncertain water quality index (WQI) has been quantified to monitor water resource quality and management, and six different computational models WQI, namely: Generalized regression neural network (GRNN), Elman Neural Network (Elm NN), Feed Forward Neural Network(FFNN), Support Vector Machine (SVM), Linear Regression (LR), and Neuro-Fuzzy (NF).
Journal ArticleDOI

Coupled online sequential extreme learning machine model with ant colony optimization algorithm for wheat yield prediction

TL;DR: In this article , a two-phase universal machine learning (ML) model was designed to predict wheat yield (Wpred), utilizing 27 agricultural counties' data within the Agro-ecological zone.
Journal ArticleDOI

Performances of MLR, RBF-NN, and MLP-NN in the evaluation and prediction of water resources quality for irrigation purposes under two modeling scenarios

TL;DR: In this article , multiple linear regression (MLR), RBF-NN and multilayer perceptron neural network (MLP-NN) models were developed for the monitoring and management of irrigation water quality (IWQ) in Ojoto area, southeastern Nigeria.
References
More filters
Journal ArticleDOI

Random Forests

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Book

The Nature of Statistical Learning Theory

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?
Journal ArticleDOI

Greedy function approximation: A gradient boosting machine.

TL;DR: A general gradient descent boosting paradigm is developed for additive expansions based on any fitting criterion, and specific algorithms are presented for least-squares, least absolute deviation, and Huber-M loss functions for regression, and multiclass logistic likelihood for classification.
Book

Genetic Algorithms

Journal ArticleDOI

An overview of statistical learning theory

TL;DR: How the abstract learning theory established conditions for generalization which are more general than those discussed in classical statistical paradigms are demonstrated and how the understanding of these conditions inspired new algorithmic approaches to function estimation problems are demonstrated.
Related Papers (5)