Building Watson: An Overview of the DeepQA Project

doi:10.1609/AIMAG.V31I3.2303

Journal ArticleDOI

Building Watson: An Overview of the DeepQA Project

David A. Ferrucci, +11 more

- 28 Jul 2010 -

Ai Magazine

- Vol. 31, Iss: 3, pp 59-79

TLDR

The results strongly suggest that DeepQA is an effective and extensible architecture that may be used as a foundation for combining, deploying, evaluating and advancing a wide range of algorithmic techniques to rapidly advance the field of QA.

Abstract:

IBM Research undertook a challenge to build a computer system that could compete at the human champion level in real time on the American TV Quiz show, Jeopardy! The extent of the challenge includes fielding a real-time automatic contestant on the show, not merely a laboratory exercise. The Jeopardy! Challenge helped us address requirements that led to the design of the DeepQA architecture and the implementation of Watson. After 3 years of intense research and development by a core team of about 20 researches, Watson is performing at human expert-levels in terms of precision, confidence and speed at the Jeopardy! Quiz show. Our results strongly suggest that DeepQA is an effective and extensible architecture that may be used as a foundation for combining, deploying, evaluating and advancing a wide range of algorithmic techniques to rapidly advance the field of QA.

Citations

PDF

Open Access

More filters

Posted Content

SQuAD: 100,000+ Questions for Machine Comprehension of Text

Pranav Rajpurkar, +3 more

- 16 Jun 2016 -

arXiv: Computation and Language

TL;DR: The Stanford Question Answering Dataset (SQuAD) as mentioned in this paper is a reading comprehension dataset consisting of 100,000+ questions posed by crowdworkers on a set of Wikipedia articles, where the answer to each question is a segment of text from the corresponding reading passage.

...read moreread less

Journal ArticleDOI

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

Ranjay Krishna, +11 more

- 01 May 2017 -

International Journal of Computer Vision

TL;DR: The Visual Genome dataset as mentioned in this paper contains over 108k images where each image has an average of $35$35 objects, $26$26 attributes, and $21$21 pairwise relationships between objects.

...read moreread less

Proceedings ArticleDOI

SQuAD: 100,000+ Questions for Machine Comprehension of Text

Pranav Rajpurkar, +3 more

TL;DR: The Stanford Question Answering Dataset (SQuAD) as mentioned in this paper is a reading comprehension dataset consisting of 100,000+ questions posed by crowdworkers on a set of Wikipedia articles, where the answer to each question is a segment of text from the corresponding reading passage.

...read moreread less

Journal ArticleDOI

DBpedia - A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia

Jens Lehmann, +11 more

- 01 Jan 2015 -

Social Work

TL;DR: An overview of the DBpedia community project is given, including its architecture, technical implementation, maintenance, internationalisation, usage statistics and applications, including DBpedia one of the central interlinking hubs in the Linked Open Data (LOD) cloud.

...read moreread less

Journal ArticleDOI

Wikidata: a free collaborative knowledgebase

Denny Vrandecic, +1 more

- 23 Sep 2014 -

Communications of The ACM

TL;DR: This collaboratively edited knowledgebase provides a common source of data for Wikipedia, and everyone else, to help improve the quality of the encyclopedia.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

WordNet: a lexical database for English

George A. Miller

- 01 Nov 1995 -

Communications of The ACM

TL;DR: WordNet1 provides a more effective combination of traditional lexicographic information and modern computing, and is an online lexical database designed for use under program control.

...read moreread less

Journal ArticleDOI

Identification of common molecular subsequences.

Temple F. Smith, +1 more

- 25 Mar 1981 -

Journal of Molecular Biology

TL;DR: This letter extends the heuristic homology algorithm of Needleman & Wunsch (1970) to find a pair of segments, one from each of two long sequences, such that there is no other Pair of segments with greater similarity (homology).

...read moreread less

Journal ArticleDOI

Original Contribution: Stacked generalization

David H. Wolpert

- 05 Feb 1992 -

Neural Networks

TL;DR: The conclusion is that for almost any real-world generalization problem one should use some version of stacked generalization to minimize the generalization error rate.

...read moreread less

Proceedings ArticleDOI

Optimizing search engines using clickthrough data

Thorsten Joachims

TL;DR: The goal of this paper is to develop a method that utilizes clickthrough data for training, namely the query-log of the search engine in connection with the log of links the users clicked on in the presented ranking.

...read moreread less

Journal ArticleDOI

Adaptive mixtures of local experts

Robert A. Jacobs, +3 more

- 01 Mar 1991 -

Neural Computation

TL;DR: A new supervised learning procedure for systems composed of many separate networks, each of which learns to handle a subset of the complete set of training cases, which is demonstrated to be able to be solved by a very simple expert network.

...read moreread less

Collapse

Building Watson: An Overview of the DeepQA Project

Citations

SQuAD: 100,000+ Questions for Machine Comprehension of Text

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

SQuAD: 100,000+ Questions for Machine Comprehension of Text

DBpedia - A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia

Wikidata: a free collaborative knowledgebase

References

WordNet: a lexical database for English

Identification of common molecular subsequences.

Original Contribution: Stacked generalization

Optimizing search engines using clickthrough data

Adaptive mixtures of local experts

Related Papers (5)

WordNet: a lexical database for English

DBpedia: a nucleus for a web of open data

Artificial Intelligence: A Modern Approach

Glove: Global Vectors for Word Representation

Computing Machinery and Intelligence