scispace - formally typeset
Journal ArticleDOI

Automatic knowledge extraction from documents

Reads0
Chats0
TLDR
This paper describes in detail what kind of shallow knowledge is extracted, how it is automatically done from a large corpus, and how additional semantics are inferred from aggregate statistics of the automatically extracted shallow knowledge.
Abstract
Access to a large amount of knowledge is critical for success at answering open-domain questions for DeepQA systems such as IBM Watson™. Formal representation of knowledge has the advantage of being easy to reason with, but acquisition of structured knowledge in open domains from unstructured data is often difficult and expensive. Our central hypothesis is that shallow syntactic knowledge and its implied semantics can be easily acquired and can be used in many areas of a question-answering system. We take a two-stage approach to extract the syntactic knowledge and implied semantics. First, shallow knowledge from large collections of documents is automatically extracted. Second, additional semantics are inferred from aggregate statistics of the automatically extracted shallow knowledge. In this paper, we describe in detail what kind of shallow knowledge is extracted, how it is automatically done from a large corpus, and how additional semantics are inferred from aggregate statistics. We also briefly discuss the various ways extracted knowledge is used throughout the IBM DeepQA system.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Introduction to This is Watson

TL;DR: A brief history of the events and ideas that positioned the team to take on the Jeopardy! challenge, build Watson, IBM Watson™, and ultimately triumph is provided, and how the system performed at champion levels is summarized.
Journal ArticleDOI

Argumentation Mining: State of the Art and Emerging Trends

TL;DR: This survey article introduces argumentation models and methods, reviews existing systems and applications, and discusses challenges and perspectives of this exciting new research area.
Journal ArticleDOI

Effects of big data analytics and traditional marketing analytics on new product success: A knowledge fusion perspective

TL;DR: The study suggests that knowledge fusion to improve NPS is not automatic and requires strategic choices to obtain its benefits.
Journal ArticleDOI

Deep parsing in Watson

TL;DR: Two deep parsing components, an English Slot Grammar (ESG) parser and a predicate-argument structure (PAS) builder, are described and illustrated how they are used in a pattern-based relation extraction component of Watson.
Journal ArticleDOI

Blockchain-Powered Parallel Healthcare Systems Based on the ACP Approach

TL;DR: The emerging blockchain technology with PHS is combined, via constructing a consortium blockchain linking patients, hospitals, health bureaus, and healthcare communities for comprehensive healthcare data sharing, medical records review, and care auditability.
References
More filters
Journal ArticleDOI

WordNet : an electronic lexical database

Christiane Fellbaum
- 01 Sep 2000 - 
TL;DR: The lexical database: nouns in WordNet, Katherine J. Miller a semantic network of English verbs, and applications of WordNet: building semantic concordances are presented.
Proceedings ArticleDOI

Yago: a core of semantic knowledge

TL;DR: YAGO as discussed by the authors is a light-weight and extensible ontology with high coverage and quality, which includes the Is-A hierarchy as well as non-taxonomic relations between entities (such as HASONEPRIZE).
Book

English Verb Classes and Alternations: A Preliminary Investigation

Beth Levin
TL;DR: Levin this paper classified over 3,000 English verbs according to shared meaning and behavior, and examined verb behavior with respect to a wide range of syntactic alternations that reflect verb meaning.
Proceedings ArticleDOI

The Berkeley FrameNet Project

TL;DR: This report will present the project's goals and workflow, and information about the computational tools that have been adapted or created in-house for this work.
Journal ArticleDOI

DBpedia - A crystallization point for the Web of Data

TL;DR: The extraction of the DBpedia knowledge base is described, the current status of interlinking DBpedia with other data sources on the Web is discussed, and an overview of applications that facilitate the Web of Data around DBpedia is given.
Related Papers (5)