Y
Yishay Mansour
Researcher at Tel Aviv University
Publications - 546
Citations - 30407
Yishay Mansour is an academic researcher from Tel Aviv University. The author has contributed to research in topics: Regret & Upper and lower bounds. The author has an hindex of 80, co-authored 511 publications receiving 26984 citations. Previous affiliations of Yishay Mansour include Technion – Israel Institute of Technology & IBM.
Papers
More filters
Book ChapterDOI
Convergence time to Nash equilibria
TL;DR: The number of steps required to reach a pure Nash Equilibrium in a load balancing scenario where each job behaves selfishly and attempts to migrate to a machine which will minimize its cost is studied.
Proceedings Article
Approximate Planning in Large POMDPs via Reusable Trajectories
TL;DR: Upper bounds on the sample complexity are proved showing that, even for infinitely large and arbitrarily complex POMDPs, the amount of data needed can be finite, and depends only linearly on the complexity of the restricted strategy class II, and exponentially on the horizon time.
Proceedings ArticleDOI
An experimental and theoretical comparison of model selection methods
TL;DR: A detailed comparison of three well-known model selection methods for finding a balance between the complexity of the hypothesis chosen and its observed error on a random training sample of limited size, when the goal is that of minimizing the resulting generalization error.
Book ChapterDOI
Learning, Regret minimization, and Equilibria
Avrim Blum,Yishay Mansour +1 more
TL;DR: This chapter presents algorithms for repeated play of a matrix game with the guarantee that against any opponent, they will perform nearly as well as the best fixed action in hindsight, and presents a general reduction showing how to convert any algorithm for minimizing external regret to one that minimizes this stronger form of regret as well.
Journal ArticleDOI
Implementing the "wisdom of the crowd"
TL;DR: In this article, the authors study a mechanism design model in which agents each arrive sequentially and choose one action from a set of actions with unknown rewards, and characterize the optimal disclosure policy of a planner whose goal is to maximize social welfare.