Learning from class-imbalanced data

doi:10.1016/J.ESWA.2016.12.035

Journal Article•DOI•

Learning from class-imbalanced data

Guo Haixiang¹, Li Yijing¹, Jennifer Shang², Gu Mingyun¹, Huang Yuanyue¹, Gong Bing³ - Show less +2 more•Institutions (3)

China University of Geosciences (Wuhan)¹, University of Pittsburgh², Technical University of Madrid³

01 May 2017-Expert Systems With Applications (Pergamon)-Vol. 73, Iss: 73, pp 220-239

TL;DR: An in depth review of rare event detection from an imbalanced learning perspective and a comprehensive taxonomy of the existing application domains of im balanced learning are provided.

read less

Abstract: 527 articles related to imbalanced data and rare events are reviewed.Viewing reviewed papers from both technical and practical perspectives.Summarizing existing methods and corresponding statistics by a new taxonomy idea.Categorizing 162 application papers into 13 domains and giving introduction.Some opening questions are discussed at the end of this manuscript. Rare events, especially those that could potentially negatively impact society, often require humans decision-making responses. Detecting rare events can be viewed as a prediction task in data mining and machine learning communities. As these events are rarely observed in daily life, the prediction task suffers from a lack of balanced data. In this paper, we provide an in depth review of rare event detection from an imbalanced learning perspective. Five hundred and seventeen related papers that have been published in the past decade were collected for the study. The initial statistics suggested that rare events detection and imbalanced learning are concerned across a wide range of research areas from management science to engineering. We reviewed all collected papers from both a technical and a practical point of view. Modeling methods discussed include techniques such as data preprocessing, classification algorithms and model evaluation. For applications, we first provide a comprehensive taxonomy of the existing application domains of imbalanced learning, and then we detail the applications for each category. Finally, some suggestions from the reviewed papers are incorporated with our experiences and judgments to offer further research directions for the imbalanced learning and rare event detection fields.

...read moreread less

Learning from class-imbalanced data

Citations

Cites background or methods or result from "Learning from class-imbalanced data..."

Cites background from "Learning from class-imbalanced data..."

References

"Learning from class-imbalanced data..." refers methods in this paper

Related Papers (5)