Knowledge vault: a web-scale approach to probabilistic knowledge fusion

doi:10.1145/2623330.2623623

Proceedings ArticleDOI

Knowledge vault: a web-scale approach to probabilistic knowledge fusion

- pp 601-610

TLDR

The Knowledge Vault is a Web-scale probabilistic knowledge base that combines extractions from Web content (obtained via analysis of text, tabular data, page structure, and human annotations) with prior knowledge derived from existing knowledge repositories that computes calibrated probabilities of fact correctness.

Abstract:

Recent years have witnessed a proliferation of large-scale knowledge bases, including Wikipedia, Freebase, YAGO, Microsoft's Satori, and Google's Knowledge Graph. To increase the scale even further, we need to explore automatic methods for constructing knowledge bases. Previous approaches have primarily focused on text-based extraction, which can be very noisy. Here we introduce Knowledge Vault, a Web-scale probabilistic knowledge base that combines extractions from Web content (obtained via analysis of text, tabular data, page structure, and human annotations) with prior knowledge derived from existing knowledge repositories. We employ supervised machine learning methods for fusing these distinct information sources. The Knowledge Vault is substantially bigger than any previously published structured knowledge repository, and features a probabilistic inference system that computes calibrated probabilities of fact correctness. We report the results of multiple studies that explore the relative utility of the different information sources and extraction methods.

Citations

PDF

Open Access

More filters

Proceedings Article

Embedding Entities and Relations for Learning and Inference in Knowledge Bases

Bishan Yang, +4 more

TL;DR: It is found that embeddings learned from the bilinear objective are particularly good at capturing relational semantics and that the composition of relations is characterized by matrix multiplication.

...read moreread less

Journal ArticleDOI

Knowledge Graph Embedding: A Survey of Approaches and Applications

Quan Wang, +3 more

- 01 Dec 2017 -

IEEE Transactions on Knowledge and Data ...

TL;DR: This article provides a systematic review of existing techniques of Knowledge graph embedding, including not only the state-of-the-arts but also those with latest trends, based on the type of information used in the embedding task.

...read moreread less

Journal ArticleDOI

A Review of Relational Machine Learning for Knowledge Graphs

Maximilian Nickel, +3 more

TL;DR: This paper provides a review of how statistical models can be “trained” on large knowledge graphs, and then used to predict new facts about the world (which is equivalent to predicting new edges in the graph) and how such statistical models of graphs can be combined with text-based information extraction methods for automatically constructing knowledge graphs from the Web.

...read moreread less

Proceedings Article

Complex embeddings for simple link prediction

Théo Trouillon, +4 more

TL;DR: This work makes use of complex valued embeddings to solve the link prediction problem through latent factorization, and uses the Hermitian dot product, the complex counterpart of the standard dot product between real vectors.

...read moreread less

Posted Content

Complex Embeddings for Simple Link Prediction

Théo Trouillon, +4 more

- 20 Jun 2016 -

arXiv: Artificial Intelligence

TL;DR: In this article, the authors make use of complex valued embeddings to handle a large variety of binary relations, among them symmetric and antisymmetric relations, and their approach is scalable to large datasets as it remains linear in both space and time.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Posted Content

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

- 16 Jan 2013 -

arXiv: Computation and Language

TL;DR: This paper proposed two novel model architectures for computing continuous vector representations of words from very large data sets, and the quality of these representations is measured in a word similarity task and the results are compared to the previously best performing techniques based on different types of neural networks.

...read moreread less

Proceedings Article

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

TL;DR: Two novel model architectures for computing continuous vector representations of words from very large data sets are proposed and it is shown that these vectors provide state-of-the-art performance on the authors' test set for measuring syntactic and semantic word similarities.

...read moreread less

Book ChapterDOI

DBpedia: a nucleus for a web of open data

Sören Auer, +5 more

TL;DR: The extraction of the DBpedia datasets is described, and how the resulting information is published on the Web for human-andmachine-consumption and how DBpedia could serve as a nucleus for an emerging Web of open data.

...read moreread less

Proceedings ArticleDOI

Freebase: a collaboratively created graph database for structuring human knowledge

Kurt Bollacker, +4 more

TL;DR: MQL provides an easy-to-use object-oriented interface to the tuple data in Freebase and is designed to facilitate the creation of collaborative, Web-based data-oriented applications.

...read moreread less

Proceedings ArticleDOI

Yago: a core of semantic knowledge

Fabian M. Suchanek, +2 more

TL;DR: YAGO as discussed by the authors is a light-weight and extensible ontology with high coverage and quality, which includes the Is-A hierarchy as well as non-taxonomic relations between entities (such as HASONEPRIZE).

...read moreread less

Collapse

Knowledge vault: a web-scale approach to probabilistic knowledge fusion

Citations

Embedding Entities and Relations for Learning and Inference in Knowledge Bases

Knowledge Graph Embedding: A Survey of Approaches and Applications

A Review of Relational Machine Learning for Knowledge Graphs

Complex embeddings for simple link prediction

Complex Embeddings for Simple Link Prediction

References

Efficient Estimation of Word Representations in Vector Space

Efficient Estimation of Word Representations in Vector Space

DBpedia: a nucleus for a web of open data

Freebase: a collaboratively created graph database for structuring human knowledge

Yago: a core of semantic knowledge

Related Papers (5)

Freebase: a collaboratively created graph database for structuring human knowledge

Yago: a core of semantic knowledge

Translating Embeddings for Modeling Multi-relational Data

Knowledge graph embedding by translating on hyperplanes

Learning entity and relation embeddings for knowledge graph completion