scispace - formally typeset
M

Maria Carolina Monard

Researcher at University of São Paulo

Publications -  117
Citations -  6327

Maria Carolina Monard is an academic researcher from University of São Paulo. The author has contributed to research in topics: Feature selection & Fuzzy classification. The author has an hindex of 22, co-authored 117 publications receiving 5173 citations. Previous affiliations of Maria Carolina Monard include Spanish National Research Council & Universidade Federal do Rio Grande do Sul.

Papers
More filters
Journal ArticleDOI

A study of the behavior of several methods for balancing machine learning training data

TL;DR: This work performs a broad experimental evaluation involving ten methods, three of them proposed by the authors, to deal with the class imbalance problem in thirteen UCI data sets, and shows that, in general, over-sampling methods provide more accurate results than under-sampled methods considering the area under the ROC curve (AUC).
Journal ArticleDOI

An analysis of four missing data treatment methods for supervised learning

TL;DR: This analysis indicates that missing data imputation based on the k-nearest neighbor algorithm can outperform the internal methods used by C4.5 and CN2 to treat missing data, and can also outperforms the mean or mode imputation method, which is a method broadly used to treatMissing values.
Book ChapterDOI

Class Imbalances versus Class Overlapping: An Analysis of a Learning System Behavior

TL;DR: This work develops a systematic study aiming to question whether class imbalances are truly to blame for the loss of performance of learning systems or whether the class imbalance are not a problem by themselves.

A Study of K-Nearest Neighbour as an Imputation Method.

TL;DR: This analysis indicates that missing data imputation based on the k-nearest neighbour algorithm can outperform the internal methods used by C4.5 and CN2 to treat missing data.

Balancing Training Data for Automated Annotation of Keywords: a Case Study.

TL;DR: The experiments show that the classifiers induced from balanced data sampled with the present work are more accurate than those induced from the original data.