scispace - formally typeset
M

Mahdi Milani Fard

Researcher at Google

Publications -  26
Citations -  364

Mahdi Milani Fard is an academic researcher from Google. The author has contributed to research in topics: Reinforcement learning & Metric (mathematics). The author has an hindex of 10, co-authored 24 publications receiving 302 citations. Previous affiliations of Mahdi Milani Fard include McGill University & University of Tehran.

Papers
More filters
Proceedings Article

Fast and Flexible Monotonic Functions with Ensembles of Lattices

TL;DR: This work learns ensembles of monotonic calibrated interpolated look-up tables that produce similar or better accuracy, while providing guaranteed monotonicity consistent with prior knowledge, smaller model size and faster evaluation.
Proceedings Article

Modelling Sparse Dynamical Systems with Compressed Predictive State Representations

TL;DR: A new algorithm, called Compressed Predictive State Representation (CPSR), is presented, for learning models of high-dimensional partially observable uncontrolled dynamical systems from small sample sets, that exploits a particular sparse structure present in many domains.
Journal ArticleDOI

Efficient learning and planning with compressed predictive states

TL;DR: In this article, the authors introduce the notion of compressed predictive state representations (CPSRs), which combine dimensionality reduction, incremental matrix decomposition, and compressed sensing to learn accurate approximations of PSRs, drastically reducing the computational costs associated with learning.
Proceedings Article

PAC-Bayesian Model Selection for Reinforcement Learning

TL;DR: The first set of PAC-Bayesian bounds for the batch reinforcement learning problem in finite state spaces are introduced and it is demonstrated how such bounds can be used for model-selection in control problems where prior information is available either on the dynamics of the environment, or on the value of actions.
Proceedings Article

Compressed least-squares regression on sparse spaces

TL;DR: This paper develops the bias-variance analysis of a least-squares regression estimator in compressed spaces when random projections are applied on sparse input signals and shows how the choice of the projection size affects the performance of regression on compressed spaces.