Home
/
Authors
/
Jan Kleindienst

Author

Jan Kleindienst

Other affiliations: Nuance Communications

Bio: Jan Kleindienst is an academic researcher from IBM. The author has contributed to research in topics: Dialog system & Dialog box. The author has an hindex of 26, co-authored 96 publications receiving 3507 citations. Previous affiliations of Jan Kleindienst include Nuance Communications.

Papers published on a yearly basis

2023
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2005
2004
2003
2002
2001
2000
1999
1998

Papers

PDF

Open Access

More filters

Patent•

Systems and methods for implementing modular DOM (Document Object Model)-based multi-modal browsers

[...]

David Boloker, Rafah A. Hosn, Photina Jaeyun Jang, Jan Kleindienst, Tomas Macek, Stephane H. Maes, T. V. Raman, Ladislav Seredi - Show less +4 more

04 Dec 2001

TL;DR: In this article, the authors present a framework for building modular multi-modal browsers using a DOM (Document Object Model) and MVC (Model-View-Controller) framework that enables a user to interact in parallel with the same information via a multiplicity of channels, devices, and/or user interfaces.

...read moreread less

Abstract: Systems and methods for building multi-modal browsers applications and, in particular, to systems and methods for building modular multi-modal browsers using a DOM (Document Object Model) and MVC (Model-View-Controller) framework that enables a user to interact in parallel with the same information via a multiplicity of channels, devices, and/or user interfaces, while presenting a unified, synchronized view of such information across the various channels, devices and/or user interfaces supported by the multi-modal browser. The use of a DOM framework (or specifications similar to DOM) allows existing browsers to be extended without modification of the underling browser code. A multi-modal browser framework is modular and flexible to allow various fat client and thin (distributed) client approaches.

...read moreread less

342 citations

Proceedings Article•DOI•

Text Understanding with the Attention Sum Reader Network

[...]

Rudolf Kadlec¹, Martin Schmid², Ondrej Bajgar¹, Jan Kleindienst¹•Institutions (2)

IBM¹, Charles University in Prague²

04 Mar 2016

TL;DR: This paper presented a new model that uses attention to directly pick the answer from the context as opposed to computing the answer using a blended representation of words in the document as is usual in similar models.

...read moreread less

Abstract: Several large cloze-style context-question-answer datasets have been introduced recently: the CNN and Daily Mail news data and the Children's Book Test. Thanks to the size of these datasets, the associated text comprehension task is well suited for deep-learning techniques that currently seem to outperform all alternative approaches. We present a new, simple model that uses attention to directly pick the answer from the context as opposed to computing the answer using a blended representation of words in the document as is usual in similar models. This makes the model particularly suitable for question-answering problems where the answer is a single word from the document. Ensemble of our models sets new state of the art on all evaluated datasets.

...read moreread less

272 citations

Posted Content•

Text Understanding with the Attention Sum Reader Network

[...]

Rudolf Kadlec¹, Martin Schmid², Ondrej Bajgar¹, Jan Kleindienst¹•Institutions (2)

IBM¹, Charles University in Prague²

04 Mar 2016-arXiv: Computation and Language

TL;DR: This article presented a new model that uses attention to directly pick the answer from the context as opposed to computing the answer using a blended representation of words in the document as is usual in similar models.

...read moreread less

235 citations

Patent•

Systems and methods for providing conversational computing via javaserver pages and javabeans

[...]

Jaroslav Gergic, Jan Kleindienst, Stephane H. Maes, T. V. Raman, Jan Sedivy - Show less +1 more

18 Apr 2001

TL;DR: In this paper, a conversational Markup Language (CML) is proposed for representing dialogues or conversations the user will have with any given computing device, where interaction may comprise, but is not limited, visual based (text and graphical) user interaction and speech based user interaction.

...read moreread less

Abstract: A new application programming language is provided which is based on user interaction with any device which a user is employing to access any type of information. The new language is referred to herein as a “Conversational Markup Language (CML). In a preferred embodiment, CML is a high level XML based language for representing “dialogs” or “conversations” the user will have with any given computing device. For example, interaction may comprise, but is not limited to, visual based (text and graphical) user interaction and speech based user interaction. Such a language allows application authors to program applications using interaction-based elements referred to herein as “conversational gestures.” The present invention also provides for various embodiments of a multimodal browser capable of supporting the features of CML in accordance with various modality specific representations, e.g., HTML based graphical user interface (GUI) browser, VoiceXML based speech browser, etc.

...read moreread less

212 citations

Patent•

Command boundary identifier for conversational natural language

[...]

Ganesh N. Ramaswamy¹, Jan Kleindienst²•Institutions (2)

IBM¹, Nuance Communications²

28 Oct 1998

TL;DR: In this paper, an apparatus for automatically identifying command boundaries in a conversational natural language system, in accordance with the present invention, includes a speech recognizer for converting an input signal to recognized text and a boundary identifier coupled to the speech-recognizer for receiving the recognized text, the boundary identifier outputting the command if present in recognized text.

...read moreread less

Abstract: An apparatus for automatically identifying command boundaries in a conversational natural language system, in accordance with the present invention, includes a speech recognizer for converting an input signal to recognized text and a boundary identifier coupled to the speech recognizer for receiving the recognized text and determining if a command is present in the recognized text, the boundary identifier outputting the command if present in the recognized text. A method for identifying command boundaries in a conversational natural language system is also included.

...read moreread less

187 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20

Collapse

Cited by

PDF

Open Access

More filters

Proceedings Article•

Bidirectional Attention Flow for Machine Comprehension

[...]

Minjoon Seo¹, Aniruddha Kembhavi², Ali Farhadi¹, Hannaneh Hajishirzi¹•Institutions (2)

University of Washington¹, Allen Institute for Artificial Intelligence²

04 Nov 2016

TL;DR: The BIDAF network is introduced, a multi-stage hierarchical process that represents the context at different levels of granularity and uses bi-directional attention flow mechanism to obtain a query-aware context representation without early summarization.

...read moreread less

Abstract: Machine comprehension (MC), answering a query about a given context paragraph, requires modeling complex interactions between the context and the query. Recently, attention mechanisms have been successfully extended to MC. Typically these methods use attention to focus on a small portion of the context and summarize it with a fixed-size vector, couple attentions temporally, and/or often form a uni-directional attention. In this paper we introduce the Bi-Directional Attention Flow (BIDAF) network, a multi-stage hierarchical process that represents the context at different levels of granularity and uses bi-directional attention flow mechanism to obtain a query-aware context representation without early summarization. Our experimental evaluations show that our model achieves the state-of-the-art results in Stanford Question Answering Dataset (SQuAD) and CNN/DailyMail cloze test.

...read moreread less

1,718 citations

Patent•

Intelligent Automated Assistant

[...]

Thomas R. Gruber¹, Adam Cheyer¹, Dag Kittlaus¹, Didier Rene Guzzoni¹, Christopher Dean Brigham¹, Richard Donald Giuli¹, Marcello Bastea-Forte¹, Harry J. Saddler¹ - Show less +4 more•Institutions (1)

Apple Inc.¹

11 Jan 2011

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.

...read moreread less

Abstract: An intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions. The system can be implemented using any of a number of different platforms, such as the web, email, smartphone, and the like, or any combination thereof. In one embodiment, the system is based on sets of interrelated domains and tasks, and employs additional functionally powered by external services with which the system can interact.

...read moreread less

1,462 citations

Proceedings Article•

MS MARCO: A Human Generated MAchine Reading COmprehension Dataset.

[...]

Tri Nguyen¹, Mir Rosenberg, Xia Song², Jianfeng Gao², Saurabh Tiwary², Rangan Majumder, Li Deng² - Show less +3 more•Institutions (2)

California Institute of Technology¹, Microsoft²

04 Nov 2016

TL;DR: MS MARCO as mentioned in this paper is a large scale dataset for reading comprehension and question answering, where all questions are sampled from real anonymized user queries and context passages from which answers in the dataset are derived from real web documents using the most advanced version of the Bing search engine.

...read moreread less

Abstract: This paper presents our recent work on the design and development of a new, large scale dataset, which we name MS MARCO, for MAchine Reading COmprehension. This new dataset is aimed to overcome a number of well-known weaknesses of previous publicly available datasets for the same task of reading comprehension and question answering. In MS MARCO, all questions are sampled from real anonymized user queries. The context passages, from which answers in the dataset are derived, are extracted from real web documents using the most advanced version of the Bing search engine. The answers to the queries are human generated. Finally, a subset of these queries has multiple answers. We aim to release one million queries and the corresponding answers in the dataset, which, to the best of our knowledge, is the most comprehensive real-world dataset of its kind in both quantity and quality. We are currently releasing 100,000 queries with their corresponding answers to inspire work in reading comprehension and question answering along with gathering feedback from the research community.

...read moreread less

1,271 citations

Patent•

Method and system for enabling connectivity to a data system

[...]

David George, Joseph Harb, Chris Haven, Dennis Ferry, Wen-Hsin Lee, Jaya Srinivasan - Show less +2 more

13 Feb 2003

TL;DR: In this article, an XSLT style sheet is automatically generated to filter out data pertaining to UI objects that were not voice or pass-through enabled, such as screens, views, applets, columns and fields.

...read moreread less

Abstract: A method and system that provides filtered data from a data system (16). In one embodiment that system includes an API (application programming interface) and associated software modules to enable third party applications to access an enterprise data system. Administrators are enabled to select specific user interface (UI) objects (72), such as screens, views, applets, columns and fields to voice or pass-through enable via a GUI (108) that presents a tree depicting a hierarchy of the UI objects (72) within a user interface of an application (14). An XSLT style sheet is then automatically generated to filter out data pertaining to UI objects (72) that were not voice or pass-through enabled. In response to a request for data, unfiltered data are retrieved from the data system and a specified style sheet is applied to the unfiltered data to return filtered data pertaining to only those fields and columns that are voice or pass-through enabled.

...read moreread less

1,226 citations

Patent•DOI•

Systems and methods for responding to natural language speech utterance

[...]

Robert A. Kennewick, David Locke, Michael R. Kennewick, Richard Kennewick, Tom Freeman - Show less +1 more

12 Feb 2010-Journal of the Acoustical Society of America

TL;DR: In this paper, a system for receiving speech and non-speech communications of natural language questions and commands, transcribing the speech and NN communications to textual messages, and executing the questions and/or commands is presented.

...read moreread less

Abstract: Systems and methods are provided for receiving speech and non-speech communications of natural language questions and/or commands, transcribing the speech and non-speech communications to textual messages, and executing the questions and/or commands. The invention applies context, prior information, domain knowledge, and user specific profile data to achieve a natural environment for one or more users presenting questions or commands across multiple domains. The systems and methods creates, stores and uses extensive personal profile information for each user, thereby improving the reliability of determining the context of the speech and non-speech communications and presenting the expected results for a particular question or command.

...read moreread less

1,164 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse