scispace - formally typeset
Open AccessJournal ArticleDOI

GIS-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks

Reads0
Chats0
TLDR
In this paper, the authors proposed and verified a novel ensemble methodology that could improve prediction performance of landslide susceptibility models based on the functional tree classifier and three current state-of-the art machine learning ensemble frameworks, Bagging, AdaBoost, and MultiBoost.
Abstract
The main objective of this study is to propose and verify a novel ensemble methodology that could improve prediction performances of landslide susceptibility models. The proposed methodology is based on the functional tree classifier and three current state-of-the art machine learning ensemble frameworks, Bagging, AdaBoost, and MultiBoost. According to current literature, these methods have been rarely used for the modeling of rainfall-induced landslides. The corridor of the National Road 32 (Vietnam) was selected as a case study. In the first stage, the landslide inventory map with 262 landslide polygons that occurred during the last 20 years was constructed and then was randomly partitioned into a ratio of 70/30 for training and validating the models. Second, ten landslide conditioning factors were prepared such as slope, aspect, relief amplitude, topographic wetness index, topographic shape, distance to roads, distance to rivers, distance to faults, lithology, and rainfall. The model performance was assessed and compared using the receiver operating characteristic and statistical evaluation measures. Overall, the FT with Bagging model has the highest prediction capability (AUC = 0.917), followed by the FT with MultiBoost model (AUC = 0.910), the FT model (AUC = 0.898), and the FT with AdaBoost model (AUC = 0.882). Compared with those derived from popular methods such as J48 decision trees and artificial neural networks, the performance of the FT with Bagging model is better. Therefore, it can be concluded that the FT with Bagging is promising and could be used as an alternative in landslide susceptibility assessment. The result in this study is useful for land use planning and decision making in landslide prone areas.

read more

Content maybe subject to copyright    Report

GIS-based modeling of rainfall-induced landslides using data mining-based functional
trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks
ABSTRACT
The main objective of this study is to propose and verify a novel ensemble methodology that
could improve prediction performances of landslide susceptibility models. The proposed
methodology is based on the functional tree classifier and three current state-of-the art
machine learning ensemble frameworks, Bagging, AdaBoost, and MultiBoost. According to
current literature, these methods have been rarely used for the modeling of rainfall-induced
landslides. The corridor of the National Road 32 (Vietnam) was selected as a case study. In
the first stage, the landslide inventory map with 262 landslide polygons that occurred during
the last 20 years was constructed and then was randomly partitioned into a ratio of 70/30 for
training and validating the models. Second, ten landslide conditioning factors were prepared
such as slope, aspect, relief amplitude, topographic wetness index, topographic shape,
distance to roads, distance to rivers, distance to faults, lithology, and rainfall. The model
performance was assessed and compared using the receiver operating characteristic and
statistical evaluation measures. Overall, the FT with Bagging model has the highest
prediction capability (AUC = 0.917), followed by the FT with MultiBoost model
(AUC = 0.910), the FT model (AUC = 0.898), and the FT with AdaBoost model
(AUC = 0.882). Compared with those derived from popular methods such as J48 decision
trees and artificial neural networks, the performance of the FT with Bagging model is better.
Therefore, it can be concluded that the FT with Bagging is promising and could be used as an
alternative in landslide susceptibility assessment. The result in this study is useful for land
use planning and decision making in landslide prone areas.
Keyword: Landslide; GIS; Functional trees; AdaBoost; MultiBoost; Bagging; Vietnam
Citations
More filters
Journal ArticleDOI

Performance evaluation of the GIS-based data mining techniques of best-first decision tree, random forest, and naïve Bayes tree for landslide susceptibility modeling

TL;DR: The main aim of the present study is to explore and compare three state-of-the art data mining techniques, best-first decision tree, random forest, and naïve Bayes tree, for landslide susceptibility assessment in the Longhai area of China.
Journal ArticleDOI

Comparison of convolutional neural networks for landslide susceptibility mapping in Yanshan County, China.

TL;DR: The experimental results demonstrated that the proportions of highly susceptible zones in all of the CNN landslide susceptibility maps are highly similar and lower than 30%, which indicates that these CNNs are more practical for landslide prevention and management than conventional methods.
References
More filters
Journal ArticleDOI

The measurement of observer agreement for categorical data

TL;DR: A general statistical methodology for the analysis of multivariate categorical data arising from observer reliability studies is presented and tests for interobserver bias are presented in terms of first-order marginal homogeneity and measures of interob server agreement are developed as generalized kappa-type statistics.
Journal ArticleDOI

A Coefficient of agreement for nominal Scales

TL;DR: In this article, the authors present a procedure for having two or more judges independently categorize a sample of units and determine the degree, significance, and significance of the units. But they do not discuss the extent to which these judgments are reproducible, i.e., reliable.
Book

C4.5: Programs for Machine Learning

TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.
Book

Data Mining: Practical Machine Learning Tools and Techniques

TL;DR: This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.
Journal ArticleDOI

An introduction to ROC analysis

TL;DR: The purpose of this article is to serve as an introduction to ROC graphs and as a guide for using them in research.
Related Papers (5)