Multi-modal Bayesian embeddings for learning social knowledge graphs

A multi-modal Bayesian embedding model, GenVector, is proposed to learn latent topics that generate word and network embeddings in a shared latent topic space, and significantly decreases the error rate in an online A/B test with live users.

Abstract:

We study the extent to which online social networks can be connected to knowledge bases. The problem is referred to as learning social knowledge graphs. We propose a multi-modal Bayesian embedding model, GenVector, to learn latent topics that generate word embeddings and network embeddings simultaneously. GenVector leverages large-scale unlabeled data with embeddings and represents data of two modalities--i.e., social network users and knowledge concepts--in a shared latent topic space. Experiments on three datasets show that the proposed method clearly outperforms state-of-the-art methods. We then deploy the method on AMiner, an online academic search system to connect with a network of 38,049,189 researchers with a knowledge base with 35,415,011 concepts. Our method significantly decreases the error rate of learning social knowledge graphs in an online A/B test with live users.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Graph embedding techniques, applications, and performance: A survey

Palash Goyal,Emilio Ferrara +1 moreUniversity of Southern California

- 01 Jul 2018 -

Knowledge Based Systems

Show Less

TL;DR: A comprehensive and structured analysis of various graph embedding techniques proposed in the literature, and the open-source Python library, named GEM (Graph Embedding Methods, available at https://github.com/palash1992/GEM ), which provides all presented algorithms within a unified interface to foster and facilitate research on the topic.

...read moreread less

Journal ArticleDOI

A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications

Hongyun Cai,Vincent W. Zheng,Kevin Chen-Chuan Chang +2 moreAgency for Science, Technology and Research,University of Illinois at Urbana–Champaign

- 01 Sep 2018 -

IEEE Transactions on Knowledge and Data ...

Show Less

TL;DR: A comprehensive review of the literature in graph embedding can be found in this paper, where the authors introduce the formal definition of graph embeddings as well as the related concepts.

...read moreread less

Posted Content

A Comprehensive Survey of Graph Embedding: Problems, Techniques and Applications

Hongyun Cai,Vincent W. Zheng,Kevin Chen-Chuan Chang +2 moreAgency for Science, Technology and Research,University of Illinois at Urbana–Champaign

- 22 Sep 2017 -

arXiv: Artificial Intelligence

Show Less

TL;DR: This survey conducts a comprehensive review of the literature in graph embedding and proposes two taxonomies ofGraph embedding which correspond to what challenges exist in differentgraph embedding problem settings and how the existing work addresses these challenges in their solutions.

...read moreread less

Proceedings ArticleDOI

Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and node2vec.

Jiezhong Qiu,Yuxiao Dong,Hao Ma,Jian Li,Kuansan Wang,Jie Tang +5 moreTsinghua University,Microsoft

- 09 Oct 2017 -

arXiv: Social and Information Networks

Show Less

TL;DR: The NetMF method offers significant improvements over DeepWalk and LINE for conventional network mining tasks and provides the theoretical connections between skip-gram based network embedding algorithms and the theory of graph Laplacian.

...read moreread less

Proceedings ArticleDOI

Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and node2vec

Jiezhong Qiu,Yuxiao Dong,Hao Ma,Jian Li,Kuansan Wang,Jie Tang +5 moreTsinghua University,Microsoft

Show Less

TL;DR: In this paper, a unified matrix factorization framework for skip-gram based network embedding was proposed, leading to a better understanding of latent network representation learning and the theory of graph Laplacian.

...read moreread less

1
2
3
4
…
5
6

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Latent dirichlet allocation

David M. Blei,Andrew Y. Ng,Michael I. Jordan +2 moreUniversity of California, Berkeley,Stanford University

- 01 Mar 2003 -

Journal of Machine Learning Research

Show Less

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.

...read moreread less

Proceedings Article

Latent Dirichlet Allocation

David M. Blei,Andrew Y. Ng,Michael I. Jordan +2 moreUniversity of California, Berkeley

Show Less

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).

...read moreread less

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov,Ilya Sutskever,Kai Chen,Greg S. Corrado,Jeffrey Dean +4 moreGoogle

Show Less

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Posted Content

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov,Ilya Sutskever,Kai Chen,Greg S. Corrado,Jeffrey Dean +4 moreGoogle

- 16 Oct 2013 -

arXiv: Computation and Language

Show Less

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.

...read moreread less

Proceedings ArticleDOI

DeepWalk: online learning of social representations

Bryan Perozzi,Rami Al-Rfou,Steven Skiena +2 moreStony Brook University

Show Less

TL;DR: DeepWalk as mentioned in this paper uses local information obtained from truncated random walks to learn latent representations by treating walks as the equivalent of sentences, which encode social relations in a continuous vector space, which is easily exploited by statistical models.

...read moreread less

1
2
3
4
…
5
6
7
8

Collapse

SciSpace

About Careers Resources Support Browse Papers Pricing SciSpace Affiliate Program Cancellation & Refund Policy Terms Privacy

Tools

Citation generator AI Detector Paraphraser Citation Booster

Extensions

SciSpace

Directories

Papers Topics Journals Authors Conferences Institutions Questions Citation Styles

Contact

support@typeset.io +91 8431021544

Multi-modal Bayesian embeddings for learning social knowledge graphs

Citations

Graph embedding techniques, applications, and performance: A survey

A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications

A Comprehensive Survey of Graph Embedding: Problems, Techniques and Applications

Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and node2vec.

Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and node2vec

References

Latent dirichlet allocation

Latent Dirichlet Allocation

Distributed Representations of Words and Phrases and their Compositionality

Distributed Representations of Words and Phrases and their Compositionality

DeepWalk: online learning of social representations

Related Papers (5)

node2vec: Scalable Feature Learning for Networks

DeepWalk: online learning of social representations

GraRep: Learning Graph Representations with Global Structural Information

Distributed Representations of Words and Phrases and their Compositionality

Structural Deep Network Embedding