Gradient boosting for high-dimensional prediction of rare events

doi:10.1016/J.CSDA.2016.07.016

Journal ArticleDOI

Gradient boosting for high-dimensional prediction of rare events

Rok Blagus, +1 more

- 01 Sep 2017 -

Computational Statistics & Data Analysis

- Vol. 113, pp 19-37

Chats0

TLDR

It is demonstrated that the proposed corrections successfully remove the rare events bias and outperform the other ensemble classifiers that were considered and large flexibility and high interpretability of the proposed methods is also illustrated.

About:

This article is published in Computational Statistics & Data Analysis.The article was published on 2017-09-01. It has received 46 citations till now. The article focuses on the topics: Gradient boosting & Rare events.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Learning from class-imbalanced data

Guo Haixiang, +5 more

- 01 May 2017 -

Expert Systems With Applications

TL;DR: An in depth review of rare event detection from an imbalanced learning perspective and a comprehensive taxonomy of the existing application domains of im balanced learning are provided.

...read moreread less

Journal ArticleDOI

A Local Adaptive Minority Selection and Oversampling Method for Class-Imbalanced Fault Diagnostics in Industrial Systems

Zhenyu Wu, +5 more

- 01 Dec 2020 -

IEEE Transactions on Reliability

TL;DR: The developed method uses a local-weighted minority oversampling strategy to identify hard-to-learn informative minority fault samples and an EM-based imputation algorithm to generate fault samples based on the distribution of minority samples.

...read moreread less

Journal ArticleDOI

Machine learning for energy performance prediction at the design stage of buildings

Razak Olu-Ajayi, +4 more

- 01 Feb 2022 -

Energy for Sustainable Development

TL;DR: It is shown that it is possible to develop a high performing ML model for building energy use prediction at the design stage and Gradient Boosting (GB) outperformed the other models with an accuracy of 0.67 for predicting building energy performance.

...read moreread less

Proceedings ArticleDOI

Improving Imbalanced Dataset Classification Using Oversampling and Gradient Boosting

Nur Heri Cahyana, +2 more

TL;DR: Experiments showed that oversampling technic increase accuracy from 2% to 11% for the dataset Mammography, Liver Disorders, Diabetes (Pima Indian), Indian Liver, Habberman, and Immunotherapy, and Borderline-SMOTE increases higher accuracy compared to other oversampled method.

...read moreread less

Journal ArticleDOI

LRID: A new metric of multi-class imbalance degree based on likelihood-ratio test

Rui Zhu, +5 more

- 11 Sep 2018 -

Pattern Recognition Letters

TL;DR: A new metric based on the likelihood-ratio test, LRID, is proposed to provide a more reliable measurement of class-imbalance extent for multi-class data.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Random Forests

Leo Breiman

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.

...read moreread less

Journal ArticleDOI

Support-Vector Networks

Corinna Cortes, +1 more

- 15 Sep 1995 -

Machine Learning

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.

...read moreread less

Journal ArticleDOI

Classification and Regression Trees.

John Van Ryzin, +4 more

- 01 Mar 1986 -

Journal of the American Statistical Asso...

Journal ArticleDOI

A Simple Sequentially Rejective Multiple Test Procedure

Sture Holm

- 01 Jan 1979 -

Scandinavian Journal of Statistics

TL;DR: In this paper, a simple and widely accepted multiple test procedure of the sequentially rejective type is presented, i.e. hypotheses are rejected one at a time until no further rejections can be done.

...read moreread less

Book

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Trevor Hastie, +2 more

TL;DR: In this paper, the authors describe the important ideas in these areas in a common conceptual framework, and the emphasis is on concepts rather than mathematics, with a liberal use of color graphics.

...read moreread less