scispace - formally typeset
B

Binyamin Manela

Researcher at Ben-Gurion University of the Negev

Publications -  6
Citations -  36

Binyamin Manela is an academic researcher from Ben-Gurion University of the Negev. The author has contributed to research in topics: Hindsight bias & Process (engineering). The author has an hindex of 2, co-authored 6 publications receiving 14 citations.

Papers
More filters
Posted Content

Bias-Reduced Hindsight Experience Replay with Virtual Goal Prioritization

TL;DR: In this paper, the authors proposed to prioritize virtual goals from which the agent will learn more valuable information and reduce existing bias in HER by the removal of misleading samples, which showed vast improvement in the final success rate and sample efficiency when compared to the original HER algorithm.
Posted Content

Curriculum Learning with Hindsight Experience Replay for Sequential Object Manipulation Tasks

TL;DR: This study presents a new algorithm that combines curriculum learning with Hindsight Experience Replay (HER), to learn sequential object manipulation tasks for multiple goals and sparse feedback.
Journal ArticleDOI

Curriculum learning with Hindsight Experience Replay for sequential object manipulation tasks.

TL;DR: In this paper, the authors combine curriculum learning with hindsight experience replay (HER) to learn sequential object manipulation tasks for multiple goals and sparse feedback, where the curriculum can be used to decompose a complex task into a sequence of source tasks with increasing complexity.
Journal ArticleDOI

Bias-reduced hindsight experience replay with virtual goal prioritization

TL;DR: This paper presents two improvements over the existing HER algorithm, which prioritize virtual goals from which the agent will learn more valuable information, and reduces existing bias in HER by the removal of misleading samples.
Posted Content

Deep Reinforcement Learning for Complex Manipulation Tasks with Sparse Feedback

Binyamin Manela
- 12 Jan 2020 - 
TL;DR: This thesis presents three algorithms based on the existing HER algorithm that improves its performances and enables the learning of complex, sequential, tasks using a form of curriculum learning combined with HER.