scispace - formally typeset
M

Marcin Szubert

Researcher at Poznań University of Technology

Publications -  22
Citations -  255

Marcin Szubert is an academic researcher from Poznań University of Technology. The author has contributed to research in topics: Temporal difference learning & Reinforcement learning. The author has an hindex of 10, co-authored 21 publications receiving 242 citations. Previous affiliations of Marcin Szubert include University of Vermont.

Papers
More filters
Proceedings ArticleDOI

Temporal difference learning of N-tuple networks for the game 2048

TL;DR: The conducted experiments demonstrate that the learning algorithm using afterstate value functions is able to consistently produce players winning over 97% of games, and show that n-tuple networks combined with an appropriate learning algorithm have large potential, which could be exploited in other board games.
Journal ArticleDOI

On Scalability, Generalization, and Hybridization of Coevolutionary Learning: A Case Study for Othello

TL;DR: It is demonstrated that although evolutionary-based methods yield players that fare best against a fixed heuristic player, it is the coev evolutionary temporal difference learning (CTDL), a hybrid of coevolution and TDL, that generalizes better and proves superior when confronted with a pool of previously unseen opponents.
Proceedings ArticleDOI

Learning n-tuple networks for othello by coevolutionary gradient search

TL;DR: This work proposes Coevolutionary Gradient Search, a blueprint for a family of iterative learning algorithms that combine elements of local search and population-based search, and concludes that the hybridization of search techniques improves the convergence.
Proceedings ArticleDOI

Coevolutionary Temporal Difference Learning for Othello

TL;DR: Coevolutionary Temporal Difference Learning (CTDL) as mentioned in this paper is a hybrid approach combining co-evolution and reinforcement learning that works by interlacing one-population competitive coevolution with temporal difference learning.
Proceedings ArticleDOI

Improving coevolution by random sampling

TL;DR: It is shown that if coevolution uses two-population setup and engages also random opponents, it is capable of producing equally good strategies as evolution with random sampling for the expected utility performance measure.