Top 2 papers published by Konrad Scheffler from Illumina in 2002

Proceedings Article•

Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning

[...]

Konrad Scheffler¹, Steve Young¹•Institutions (1)

24 Mar 2002

TL;DR: Q-learning with eligibility traces was applied to obtain policies for a telephone-based cinema information system, and the policies outperformed handcrafted policies that operated in the same restricted state space, and gave performance similar to the original design that had been through several iterations of manual refinement.

...read moreread less

Abstract: This paper describes a method for automatic design of human-computer dialogue strategies by means of reinforcement learning, using a dialogue simulation tool to model the user behaviour and system recognition performance. To the authors' knowledge this is the first application of a detailed simulation tool to this problem. The simulation tool is trained on a corpus of real user data. Compared to direct state transition modelling, it has the major advantage that different state space representations can be studied without collecting more training data. We applied Q-learning with eligibility traces to obtain policies for a telephone-based cinema information system, comparing the effect of different state space representations and evaluation functions. The policies outperformed handcrafted policies that operated in the same restricted state space, and gave performance similar to the original design that had been through several iterations of manual refinement.

...read moreread less

153 citations

Proceedings Article•DOI•

Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning

[...]

Konrad Scheffler¹, Steve Young¹•Institutions (1)

University of Cambridge¹

01 Jan 2002

TL;DR: In this article, a method for automatic design of human-computer dialogue strategies by means of reinforcement learning, using a dialogue simulation tool to model the user behaviour and system recognition performance, is described.

...read moreread less

14 citations

Showing papers by "Konrad Scheffler published in 2002"