scispace - formally typeset
R

Robin Genuer

Researcher at French Institute of Health and Medical Research

Publications -  34
Citations -  3014

Robin Genuer is an academic researcher from French Institute of Health and Medical Research. The author has contributed to research in topics: Random forest & Feature selection. The author has an hindex of 13, co-authored 32 publications receiving 2262 citations. Previous affiliations of Robin Genuer include Département de Mathématiques & University of Paris-Sud.

Papers
More filters
Journal ArticleDOI

Variable selection using random forests

TL;DR: This paper proposes, focusing on random forests, the increasingly used statistical method for classification and regression problems introduced by Leo Breiman in 2001, to investigate two classical issues of variable selection, and proposes a strategy involving a ranking of explanatory variables using the random forests score of importance and a stepwise ascending variable introduction strategy.
Journal ArticleDOI

VSURF: An R Package for Variable Selection Using Random Forests

TL;DR: The VSURF algorithm returns two subsets of variables, one of which is a subset of important variables including some redundancy which can be relevant for interpretation, and the second one corresponding to a model trying to avoid redundancy focusing more closely on prediction objective.
Journal ArticleDOI

Random Forests for Big Data

TL;DR: In this article, a selective review of available proposals that deal with scaling random forests to Big Data problems is presented, which rely on parallel environments or on online adaptations of random forests.
Posted Content

Random Forests: some methodological insights

TL;DR: This paper aims at confirming, known but sparse, advice for using random forests and at proposing some complementary remarks for both standard problems as well as high dimensional ones for which the number of variables hugely exceeds the sample size.
Journal ArticleDOI

Variance reduction in purely random forests

TL;DR: A general upper bound is shown which emphasises the fact that a forest reduces the variance and it is proved that compared with random trees, RFs improve accuracy by reducing the estimator variance by a factor of three-fourths.