Automatic knowledge extraction from documents

doi:10.1147/JRD.2012.2186519

Journal ArticleDOI

Automatic knowledge extraction from documents

James Fan, +3 more

- 01 May 2012 -

Journal of Reproduction and Development

- Vol. 56, Iss: 3, pp 290-299

Chats0

TLDR

This paper describes in detail what kind of shallow knowledge is extracted, how it is automatically done from a large corpus, and how additional semantics are inferred from aggregate statistics of the automatically extracted shallow knowledge.

Abstract:

Access to a large amount of knowledge is critical for success at answering open-domain questions for DeepQA systems such as IBM Watson™. Formal representation of knowledge has the advantage of being easy to reason with, but acquisition of structured knowledge in open domains from unstructured data is often difficult and expensive. Our central hypothesis is that shallow syntactic knowledge and its implied semantics can be easily acquired and can be used in many areas of a question-answering system. We take a two-stage approach to extract the syntactic knowledge and implied semantics. First, shallow knowledge from large collections of documents is automatically extracted. Second, additional semantics are inferred from aggregate statistics of the automatically extracted shallow knowledge. In this paper, we describe in detail what kind of shallow knowledge is extracted, how it is automatically done from a large corpus, and how additional semantics are inferred from aggregate statistics. We also briefly discuss the various ways extracted knowledge is used throughout the IBM DeepQA system.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Introduction to This is Watson

David A. Ferrucci

- 01 May 2012 -

Journal of Reproduction and Development

TL;DR: A brief history of the events and ideas that positioned the team to take on the Jeopardy! challenge, build Watson, IBM Watson™, and ultimately triumph is provided, and how the system performed at champion levels is summarized.

...read moreread less

Journal ArticleDOI

Argumentation Mining: State of the Art and Emerging Trends

Marco Lippi, +1 more

- 30 Mar 2016 -

ACM Transactions on Internet Technology

TL;DR: This survey article introduces argumentation models and methods, reviews existing systems and applications, and discusses challenges and perspectives of this exciting new research area.

...read moreread less

Journal ArticleDOI

Effects of big data analytics and traditional marketing analytics on new product success: A knowledge fusion perspective

Zhenning Xu, +2 more

- 01 May 2016 -

Journal of Business Research

TL;DR: The study suggests that knowledge fusion to improve NPS is not automatic and requires strategic choices to obtain its benefits.

...read moreread less

Journal ArticleDOI

Deep parsing in Watson

M. C. McCord, +2 more

- 01 May 2012 -

Journal of Reproduction and Development

TL;DR: Two deep parsing components, an English Slot Grammar (ESG) parser and a predicate-argument structure (PAS) builder, are described and illustrated how they are used in a pattern-based relation extraction component of Watson.

...read moreread less

Journal ArticleDOI

Blockchain-Powered Parallel Healthcare Systems Based on the ACP Approach

Shuai Wang, +7 more

- 28 Aug 2018 -

IEEE Transactions on Computational Socia...

TL;DR: The emerging blockchain technology with PHS is combined, via constructing a consortium blockchain linking patients, hospitals, health bureaus, and healthcare communities for comprehensive healthcare data sharing, medical records review, and care auditability.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

WordNet : an electronic lexical database

Christiane Fellbaum

- 01 Sep 2000 -

Language

TL;DR: The lexical database: nouns in WordNet, Katherine J. Miller a semantic network of English verbs, and applications of WordNet: building semantic concordances are presented.

...read moreread less

Proceedings ArticleDOI

Yago: a core of semantic knowledge

Fabian M. Suchanek, +2 more

TL;DR: YAGO as discussed by the authors is a light-weight and extensible ontology with high coverage and quality, which includes the Is-A hierarchy as well as non-taxonomic relations between entities (such as HASONEPRIZE).

...read moreread less

Book

English Verb Classes and Alternations: A Preliminary Investigation

Beth Levin

TL;DR: Levin this paper classified over 3,000 English verbs according to shared meaning and behavior, and examined verb behavior with respect to a wide range of syntactic alternations that reflect verb meaning.

...read moreread less

Proceedings ArticleDOI

The Berkeley FrameNet Project

Collin F. Baker, +2 more

TL;DR: This report will present the project's goals and workflow, and information about the computational tools that have been adapted or created in-house for this work.

...read moreread less

Journal ArticleDOI

DBpedia - A crystallization point for the Web of Data

Christian Bizer, +6 more

- 01 Sep 2009 -

Journal of Web Semantics

TL;DR: The extraction of the DBpedia knowledge base is described, the current status of interlinking DBpedia with other data sources on the Web is discussed, and an overview of applications that facilitate the Web of Data around DBpedia is given.

...read moreread less