scispace - formally typeset
A

Aurélien Garivier

Researcher at École normale supérieure de Lyon

Publications -  114
Citations -  5105

Aurélien Garivier is an academic researcher from École normale supérieure de Lyon. The author has contributed to research in topics: Regret & Upper and lower bounds. The author has an hindex of 25, co-authored 103 publications receiving 4364 citations. Previous affiliations of Aurélien Garivier include Paul Sabatier University & Institut de Mathématiques de Toulouse.

Papers
More filters
Journal Article

On the complexity of best-arm identification in multi-armed bandit models

TL;DR: This work introduces generic notions of complexity for the two dominant frameworks considered in the literature: fixed-budget and fixed-confidence settings, and provides the first known distribution-dependent lower bound on the complexity that involves information-theoretic quantities and holds when m ≥ 1 under general assumptions.
Book ChapterDOI

On upper-confidence bound policies for switching bandit problems

TL;DR: An upperbound for the expected regret is established by upper-bounding the expectation of the number of times suboptimal arms are played and it is shown that the discounted UCB and the sliding-window UCB both match the lower-bound up to a logarithmic factor.
Proceedings Article

Parametric Bandits: The Generalized Linear Case

TL;DR: The analysis highlights a key difficulty in generalizing linear bandit algorithms to the non-linear case, which is solved in GLM-UCB by focusing on the reward space rather than on the parameter space, and provides a tuning method based on asymptotic arguments, which leads to significantly better practical performance.
Proceedings Article

On Bayesian Upper Confidence Bounds for Bandit Problems

TL;DR: It is proved that the corresponding algorithm, termed BayesUCB, satisfies finite-time regret bounds that imply its asymptotic optimality and gives a general formulation for a class of Bayesian index policies that rely on quantiles of the posterior distribution.
Journal ArticleDOI

Kullback-Leibler upper confidence bounds for optimal sequential allocation

TL;DR: The main contribution is a unified finite-time analysis of the regret of these algorithms that asymptotically matches the lower bounds of Lai and Robbins (1985) and Burnetas and Katehakis (1996), respectively.